Pedro Valero-Lara
Orcid: 0000-0002-1479-4310
According to our database1,
Pedro Valero-Lara
authored at least 71 papers
between 2011 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on zbmath.org
-
on orcid.org
-
on d-nb.info
On csauthors.net:
Bibliography
2024
Concurr. Comput. Pract. Exp., November, 2024
MatRIS: Addressing the Challenges for Portability and Heterogeneity Using Tasking for Matrix Decomposition (Cholesky).
Proceedings of the Asynchronous Many-Task Systems and Applications, 2024
IRIS Reimagined: Advancements in Intelligent Runtime System for Task-Based Programming.
Proceedings of the Asynchronous Many-Task Systems and Applications, 2024
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
JACC: Leveraging HPC Meta-Programming and Performance Portability with the Just-in-Time and LLVM-based Julia Language.
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
Integrating ORNL's HPC and Neutron Facilities with a Performance-Portable CPU/GPU Ecosystem.
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
sKokkos: Enabling Kokkos with Transparent Device Selection on Heterogeneous Systems using OpenACC.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region, 2024
2023
Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
MatRIS: Multi-level Math Library Abstraction for Heterogeneity and Performance Portability using IRIS Runtime.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Proceedings of the 2nd International Workshop on Extreme Heterogeneity Solutions, 2023
Proceedings of the 9th ACM SIGPLAN International Workshop on Libraries, 2023
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Evaluating performance and portability of high-level programming models: Julia, Python/Numba, and Kokkos on exascale nodes.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Proceedings of the 52nd International Conference on Parallel Processing Workshops, 2023
Proceedings of the IEEE High Performance Extreme Computing Conference, 2023
2022
IEEE Trans. Parallel Distributed Syst., 2022
Proceedings of the 9th Workshop on Accelerator Programming Using Directives, 2022
LaRIS: Targeting Portability and Productivity for LAPACK Codes on Extreme Heterogeneous Systems by Using IRIS.
Proceedings of the IEEE/ACM Redefining Scalability for Diversely Heterogeneous Architectures Workshop, 2022
Proceedings of the 12th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2022
Proceedings of the 29th IEEE International Conference on High Performance Computing, 2022
Proceedings of the Euro-Par 2022: Parallel Processing Workshops, 2022
2021
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021
Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021
2020
sLASs: A fully automatic auto-tuned linear algebra library based on OpenMP extensions implemented in OmpSs (LASs Library).
J. Parallel Distributed Comput., 2020
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020
2019
Parallel Comput., 2019
IEEE Access, 2019
IEEE Access, 2019
Proceedings of the 27th Euromicro International Conference on Parallel, 2019
Proceedings of the 20th International Conference on Parallel and Distributed Computing, 2019
Proceedings of the 20th International Conference on Parallel and Distributed Computing, 2019
2018
cuThomasBatch and cuThomasVBatch, CUDA Routines to compute batch of tridiagonal systems on NVIDIA GPUs.
Concurr. Comput. Pract. Exp., 2018
MPI+OpenMP Tasking Scalability for the Simulation of the Human Brain: Human Brain Project.
Proceedings of the 25th European MPI Users' Group Meeting, 2018
Proceedings of the 26th Euromicro International Conference on Parallel, 2018
2017
Introduction to the Special Issue on High Performance Computing Solutions for Complex Problems.
Scalable Comput. Pract. Exp., 2017
Scalable Comput. Pract. Exp., 2017
Heterogeneous CPU+GPU approaches for mesh refinement over Lattice-Boltzmann simulations.
Concurr. Comput. Pract. Exp., 2017
Concurr. Comput. Pract. Exp., 2017
NVIDIA GPUs Scalability to Solve Multiple (Batch) Tridiagonal Systems Implementation of cuThomasBatch.
Proceedings of the Parallel Processing and Applied Mathematics, 2017
Proceedings of the Advances in Computational Intelligence, 2017
cuHinesBatch: Solving Multiple Hines systems on GPUs Human Brain Project<sup>*</sup>.
Proceedings of the International Conference on Computational Science, 2017
The Design and Performance of Batched BLAS on Modern High-Performance Computing Systems.
Proceedings of the International Conference on Computational Science, 2017
2016
Introduction to the Special Issue on High Performance Computing Solutions for Complex Problems.
Scalable Comput. Pract. Exp., 2016
Proceedings of the Algorithms and Architectures for Parallel Processing, 2016
2015
Accelerating fluid-solid simulations (Lattice-Boltzmann & Immersed-Boundary) on heterogeneous architectures.
J. Comput. Sci., 2015
Proceedings of the International Conference on Computational Science, 2015
Multi-domain Grid Refinement for Lattice-Boltzmann Simulations on Heterogeneous Platforms.
Proceedings of the 18th IEEE International Conference on Computational Science and Engineering, 2015
LBM-HPC - An Open-Source Tool for Fluid Simulations. Case Study: Unified Parallel C (UPC-PGAS).
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
2014
Accelerating solid-fluid interaction based on the immersed boundary method on multicore and GPU architectures.
J. Supercomput., 2014
Comput. Phys. Commun., 2014
hLCS. A Hybrid GPGPU Approach for Solving Multiple Short and Unbalanced LCS Problems.
Proceedings of the Computational Science and Its Applications - ICCSA 2014 - 14th International Conference, Guimarães, Portugal, June 30, 2014
Accelerating Solid-fluid Interaction using Lattice-boltzmann and Immersed Boundary Coupled Simulations on Heterogeneous Platforms.
Proceedings of the International Conference on Computational Science, 2014
Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014
2013
A GPU approach for accelerating 3D deformable registration (DARTEL) on brain biomedical images.
Proceedings of the 20th European MPI Users's Group Meeting, 2013
Proceedings of the 42nd International Conference on Parallel Processing, 2013
Analysis in performance and new model for multiple kernels executions on many-core architectures.
Proceedings of the IEEE 12th International Conference on Cognitive Informatics and Cognitive Computing, 2013
2012
Proceedings of the 10th IEEE International Symposium on Parallel and Distributed Processing with Applications, 2012
Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012
Improving the Performance for the Range Search on Metric Spaces Using a Multi-GPU Platform.
Proceedings of the Database and Expert Systems Applications, 2012
2011
J. Supercomput., 2011
Proceedings of the 2011 International Conference on High Performance Computing & Simulation, 2011
Proceedings of the International Conference on Computational Science and Its Applications, 2011
Proceedings of the Computational Science and Its Applications - ICCSA 2011, 2011