Lubomir Riha

Orcid: 0000-0002-1017-5766

According to our database1, Lubomir Riha authored at least 48 papers between 2009 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Workflow for high-quality visualisation of large-scale CFD simulations by volume rendering.
Adv. Eng. Softw., 2025

2024
An approach for dynamically adaptable SIMD vectorization of FEM kernels.
Comput. Phys. Commun., 2024

Automated search of an optimal configuration of FETI-based algorithms with the swarm and evolutionary algorithms.
Appl. Soft Comput., 2024

2023
Hybrid multi-model ensemble learning for reconstructing gridded runoff of Europe for 500 years.
Inf. Fusion, September, 2023

Speed Up of Volumetric Non-Local Transform-Domain Filter Utilising HPC Architecture.
J. Imaging, 2023

Scalable Flow Simulations with the Lattice Boltzmann Method.
Proceedings of the 20th ACM International Conference on Computing Frontiers, 2023

2022
On the spectrum of Schur complements of 2D elastic clusters joined by rigid edge modes and hybrid domain decomposition.
Numerische Mathematik, 2022

Toward highly parallel loading of unstructured meshes.
Adv. Eng. Softw., 2022

2021
GPU Accelerated Path Tracing of Massive Scenes.
ACM Trans. Graph., 2021

Application instrumentation for performance analysis and tuning with focus on energy efficiency.
Concurr. Comput. Pract. Exp., 2021

2020
Batched transpose-free ADI-type preconditioners for a Poisson solver on GPGPUs.
J. Parallel Distributed Comput., 2020

Toward an End-to-End Auto-tuning Framework in HPC PowerStack.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019
A massively parallel and memory-efficient FEM toolbox with a hybrid total FETI solver with accelerator support.
Int. J. High Perform. Comput. Appl., 2019

Domain knowledge specification for energy tuning.
Concurr. Comput. Pract. Exp., 2019

Overview of Application Instrumentation for Performance Analysis and Tuning.
Proceedings of the Parallel Processing and Applied Mathematics, 2019

Performance, Power Consumption and Thermal Behavioral Evaluation of the DGX-2 Platform.
Proceedings of the Parallel Computing: Technology Trends, 2019

Evaluation of DVFS and Uncore Frequency Tuning Under Power Capping on Intel Broadwell Architecture.
Proceedings of the Parallel Computing: Technology Trends, 2019

An Approach for Parallel Loading and Pre-Processing of Unstructured Meshes Stored in Spatially Scattered Fashion.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Analysis and Visualization of the Dynamic Behavior of HPC Applications.
Proceedings of the High Performance Computing in Science and Engineering, 2019


2018
Evaluation of the Intel Xeon Phi offload runtimes for domain decomposition solvers.
Adv. Eng. Softw., 2018

Acceleration Techniques for FETI Solvers for GPU Accelerators.
Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

2017
The READEX formalism for automatic tuning for energy efficiency.
Computing, 2017

Hybrid parallelization of the total FETI solver.
Adv. Eng. Softw., 2017

Intel Xeon Phi acceleration of Hybrid Total FETI solver.
Adv. Eng. Softw., 2017

Implementation of K-means segmentation algorithm on Intel Xeon Phi and GPU: Application in medical imaging.
Adv. Eng. Softw., 2017

MERIC and RADAR Generator: Tools for Energy Evaluation and Runtime Tuning of HPC Applications.
Proceedings of the High Performance Computing in Science and Engineering, 2017

Using ESPRESO as Linear Solver Library for Third Party FEM Tools for Solving Large Scale Problems.
Proceedings of the High Performance Computing in Science and Engineering, 2017

READEX: Linking two ends of the computing continuum to improve energy-efficiency in dynamic applications.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2017

2016
Optimization of Selected Remote Sensing Algorithms for Many-Core Architectures.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2016

Implementation of the efficient communication layer for the highly parallel total FETI and hybrid total FETI solvers.
Parallel Comput., 2016

Massively Parallel Hybrid Total FETI (HTFETI) Solver.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2016

Energy consumption optimization of the Total-FETI solver and BLAS routines by changing the CPU frequency.
Proceedings of the International Conference on High Performance Computing & Simulation, 2016

2015
Communication efficient work distributions in stencil operation based applications.
Concurr. Comput. Pract. Exp., 2015

Optimization of selected remote sensing algorithms for embedded Nvidia Kepler GPU architecture.
Proceedings of the 2015 IEEE International Geoscience and Remote Sensing Symposium, 2015

Efficient Implementation of Total FETI Solver for Graphic Processing Units Using Schur Complement.
Proceedings of the High Performance Computing in Science and Engineering, 2015

Acceleration of Blender Cycles Path-Tracing Engine Using Intel Many Integrated Core Architecture.
Proceedings of the Computer Information Systems and Industrial Management, 2015

2013
An Adaptive Hybrid OLAP Architecture with optimized memory access patterns.
Clust. Comput., 2013

Application-specific processors for web-browsing: An exploration and evaluation of the design space.
Proceedings of the 24th International Conference on Application-Specific Systems, 2013

2012
Task Scheduling for GPU Accelerated Hybrid OLAP Systems with Multi-core Support and Text-to-Integer Translation.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

A method for communication efficient work distributions in stencil operation based applications on heterogeneous clusters.
Proceedings of the 2012 International Conference on High Performance Computing & Simulation, 2012

2011
Acceleration of acoustic emission signal processing algorithms using CUDA standard.
Comput. Stand. Interfaces, 2011

GPU accelerated one-pass algorithm for computing minimal rectangles of connected components.
Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV 2011), 2011

Task scheduling for GPU accelerated OLAP systems.
Proceedings of the Center for Advanced Studies on Collaborative Research, 2011

Real-time motion object tracking using GPU.
Proceedings of the 9th IEEE/ACS International Conference on Computer Systems and Applications, 2011

2010
Real-Time Motion Object Tracking Using GPU and Cell Processor.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2010

2009
Real-Time Motion Tracking Using the CELL BE.
Proceedings of the NTMS 2009, 2009

Dialog Systems For Modeling, Monitoring And Control In Sensor Network Applications.
Proceedings of the 2009 International Conference on Internet Computing, 2009


  Loading...