Antonio J. Peña
Orcid: 0000-0002-3575-4617
According to our database1,
Antonio J. Peña
authored at least 71 papers
between 2009 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on dl.acm.org
On csauthors.net:
Bibliography
2024
Malleability in Modern HPC Systems: Current Experiences, Challenges, and Future Opportunities.
IEEE Trans. Parallel Distributed Syst., September, 2024
2023
CoRR, 2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Proceedings of the IEEE International Conference on Cluster Computing, 2023
Proceedings of the 32nd ACM SIGPLAN International Conference on Compiler Construction, 2023
2022
IEEE Trans. Computers, 2022
Proceedings of the 9th Workshop on Accelerator Programming Using Directives, 2022
Proceedings of the PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2, 2022
Proceedings of the IEEE International Conference on Cluster Computing, 2022
2021
IEEE Trans. Computers, 2021
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021
Proceedings of the Euro-Par 2021: Parallel Processing, 2021
2020
IEEE Trans. Computers, 2020
Guest editorial: Special Issue on Applications and System Software for Hybrid Exascale Systems.
Parallel Comput., 2020
2019
Parallel Comput., 2019
Integrating blocking and non-blocking MPI primitives with task-based programming models.
Parallel Comput., 2019
Dynamic reconfiguration of noniterative scientific applications: A case study with HPG aligner.
Int. J. High Perform. Comput. Appl., 2019
IEEE Access, 2019
Proceedings of the 20th International Conference on Parallel and Distributed Computing, 2019
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019
2018
IEEE Trans. Parallel Distributed Syst., 2018
Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models.
J. Supercomput., 2018
Parallel Comput., 2018
Parallel Comput., 2018
Parallel Comput., 2018
On the adequacy of lightweight thread approaches for high-level parallel programming models.
Future Gener. Comput. Syst., 2018
cuThomasBatch and cuThomasVBatch, CUDA Routines to compute batch of tridiagonal systems on NVIDIA GPUs.
Concurr. Comput. Pract. Exp., 2018
MPI+OpenMP Tasking Scalability for the Simulation of the Human Brain: Human Brain Project.
Proceedings of the 25th European MPI Users' Group Meeting, 2018
Proceedings of the 25th European MPI Users' Group Meeting, 2018
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018
2017
NVIDIA GPUs Scalability to Solve Multiple (Batch) Tridiagonal Systems Implementation of cuThomasBatch.
Proceedings of the Parallel Processing and Applied Mathematics, 2017
Proceedings of the 2017 IEEE International Symposium on Performance Analysis of Systems and Software, 2017
Supporting automatic recovery in offloaded distributed programming models through MPI-3 techniques.
Proceedings of the International Conference on Supercomputing, 2017
Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017
Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017
Proceedings of the 46th International Conference on Parallel Processing, 2017
Proceedings of the 46th International Conference on Parallel Processing, 2017
cuHinesBatch: Solving Multiple Hines systems on GPUs Human Brain Project<sup>*</sup>.
Proceedings of the International Conference on Computational Science, 2017
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
2016
A data-oriented profiler to assist in data partitioning and distribution for heterogeneous memory in HPC.
Parallel Comput., 2016
Parallel Comput., 2016
Evaluating the effect of last-level cache sharing on integrated GPU-CPU systems with heterogeneous applications.
Proceedings of the 2016 IEEE International Symposium on Workload Characterization, 2016
One-Sided Interface for Matrix Operations Using MPI-3 RMA: A Case Study with Elemental.
Proceedings of the 45th International Conference on Parallel Processing, 2016
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016
2015
Concurr. Comput. Pract. Exp., 2015
Proceedings of the International Conference for High Performance Computing, 2015
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
Exploring the Suitability of Remote GPGPU Virtualization for the OpenACC Programming Model Using rCUDA.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
2014
Parallel Comput., 2014
Proceedings of the 2014 International Conference on Supercomputing, 2014
Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014
Boosting the performance of remote GPU virtualization using InfiniBand connect-IB and PCIe 3.0.
Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014
Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014
2013
Proceedings of the 20th European MPI Users's Group Meeting, 2013
Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013
Evaluation of Inter- and Intra-node Data Transfer Efficiencies between GPU Devices and their Impact on Scalable Applications.
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013
2012
Proceedings of the 19th International Conference on High Performance Computing, 2012
2011
Proceedings of the International Conference on Parallel Processing, 2011
Proceedings of the 18th International Conference on High Performance Computing, 2011
2010
Proceedings of the 2010 International Conference on High Performance Computing & Simulation, 2010
2009
Proceedings of the Euro-Par 2009, 2009