Lubomir Riha

Andrea Bartolini

Concurr. Comput. Pract. Exp., 2021

2020

Batched transpose-free ADI-type preconditioners for a Poisson solver on GPGPUs.

[BibT_eX]

[DOI]

Peter Arbenz

J. Parallel Distributed Comput., 2020

Toward an End-to-End Auto-tuning Framework in HPC PowerStack.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019

A massively parallel and memory-efficient FEM toolbox with a hybrid total FETI solver with accelerator support.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2019

Domain knowledge specification for energy tuning.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2019

Overview of Application Instrumentation for Performance Analysis and Tuning.

[BibT_eX]

[DOI]

Ondrej Vysocky

Andrea Bartolini

Proceedings of the Parallel Processing and Applied Mathematics, 2019

Performance, Power Consumption and Thermal Behavioral Evaluation of the DGX-2 Platform.

[BibT_eX]

[DOI]

Matej Spetko

Branislav Jansik

Proceedings of the Parallel Computing: Technology Trends, 2019

Evaluation of DVFS and Uncore Frequency Tuning Under Power Capping on Intel Broadwell Architecture.

[BibT_eX]

[DOI]

Ondrej Vysocky

Andrea Bartolini

Proceedings of the Parallel Computing: Technology Trends, 2019

An Approach for Parallel Loading and Pre-Processing of Unstructured Meshes Stored in Spatially Scattered Fashion.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Analysis and Visualization of the Dynamic Behavior of HPC Applications.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing in Science and Engineering, 2019

HPC, Cloud and Big-Data Convergent Architectures: The LEXIS Approach.

[BibT_eX]

[DOI]

Proceedings of the Complex, Intelligent, and Software Intensive Systems, 2019

2018

Evaluation of the Intel Xeon Phi offload runtimes for domain decomposition solvers.

[BibT_eX]

[DOI]

Adv. Eng. Softw., 2018

Acceleration Techniques for FETI Solvers for GPU Accelerators.

[BibT_eX]

[DOI]

Radim Vavrík

Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

2017

The READEX formalism for automatic tuning for energy efficiency.

[BibT_eX]

[DOI]

Computing, 2017

Hybrid parallelization of the total FETI solver.

[BibT_eX]

[DOI]

Adv. Eng. Softw., 2017

Intel Xeon Phi acceleration of Hybrid Total FETI solver.

[BibT_eX]

[DOI]

Michal Merta

Tomás Kozubek

Vít Vondrák

Adv. Eng. Softw., 2017

Implementation of K-means segmentation algorithm on Intel Xeon Phi and GPU: Application in medical imaging.

[BibT_eX]

[DOI]

Adv. Eng. Softw., 2017

MERIC and RADAR Generator: Tools for Energy Evaluation and Runtime Tuning of HPC Applications.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing in Science and Engineering, 2017

Using ESPRESO as Linear Solver Library for Third Party FEM Tools for Solving Large Scale Problems.

[BibT_eX]

[DOI]

Tomás Kozubek

Proceedings of the High Performance Computing in Science and Engineering, 2017

READEX: Linking two ends of the computing continuum to improve energy-efficiency in dynamic applications.

[BibT_eX]

[DOI]

Per Gunnar Kjeldsberg

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

2016

Optimization of Selected Remote Sensing Algorithms for Many-Core Architectures.

[BibT_eX]

[DOI]

Jacqueline Le Moigne

Tarek A. El-Ghazawi

IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2016

Implementation of the efficient communication layer for the highly parallel total FETI and hybrid total FETI solvers.

[BibT_eX]

[DOI]

Parallel Comput., 2016

Massively Parallel Hybrid Total FETI (HTFETI) Solver.

[BibT_eX]

[DOI]

Tomás Kozubek

Proceedings of the Platform for Advanced Scientific Computing Conference, 2016

Energy consumption optimization of the Total-FETI solver and BLAS routines by changing the CPU frequency.

[BibT_eX]

[DOI]

Proceedings of the International Conference on High Performance Computing & Simulation, 2016

2015

Communication efficient work distributions in stencil operation based applications.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2015

Optimization of selected remote sensing algorithms for embedded Nvidia Kepler GPU architecture.

[BibT_eX]

[DOI]

Jacqueline Le Moigne

Tarek A. El-Ghazawi

Proceedings of the 2015 IEEE International Geoscience and Remote Sensing Symposium, 2015

Efficient Implementation of Total FETI Solver for Graphic Processing Units Using Schur Complement.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing in Science and Engineering, 2015

Acceleration of Blender Cycles Path-Tracing Engine Using Intel Many Integrated Core Architecture.

[BibT_eX]

[DOI]

Proceedings of the Computer Information Systems and Industrial Management, 2015

2013

An Adaptive Hybrid OLAP Architecture with optimized memory access patterns.

[BibT_eX]

[DOI]

Maria Malik

Tarek A. El-Ghazawi

Clust. Comput., 2013

Application-specific processors for web-browsing: An exploration and evaluation of the design space.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Application-Specific Systems, 2013

2012

Task Scheduling for GPU Accelerated Hybrid OLAP Systems with Multi-core Support and Text-to-Integer Translation.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

A method for communication efficient work distributions in stencil operation based applications on heterogeneous clusters.

[BibT_eX]

[DOI]

Proceedings of the 2012 International Conference on High Performance Computing & Simulation, 2012

2011

Acceleration of acoustic emission signal processing algorithms using CUDA standard.

[BibT_eX]

[DOI]

Radislav Smid

Comput. Stand. Interfaces, 2011

GPU accelerated one-pass algorithm for computing minimal rectangles of connected components.

[BibT_eX]

[DOI]

Mareboyana Manohar

Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Task scheduling for GPU accelerated OLAP systems.

[BibT_eX]

[DOI]

Proceedings of the Center for Advanced Studies on Collaborative Research, 2011

Real-time motion object tracking using GPU.

[BibT_eX]

[DOI]

Hoda El-Sayed

Proceedings of the 9th IEEE/ACS International Conference on Computer Systems and Applications, 2011

2010

Real-Time Motion Object Tracking Using GPU and Cell Processor.

[BibT_eX]

Hoda El-Sayed

Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2010

2009

Real-Time Motion Tracking Using the CELL BE.

[BibT_eX]

[DOI]

Hoda El-Sayed