Oscar R. Hernandez

Orcid: 0000-0002-5380-6951

According to our database1, Oscar R. Hernandez authored at least 81 papers between 2001 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Impacts of floating-point non-associativity on reproducibility for HPC and deep learning applications.
CoRR, 2024

2023

Extending OpenSHMEM with Aggregation Support for Improved Message Rate Performance.
Proceedings of the Euro-Par 2023: Parallel Processing - 29th International Conference on Parallel and Distributed Computing, Limassol, Cyprus, August 28, 2023

2022
OpenMP application experiences: Porting to accelerated nodes.
Parallel Comput., 2022

Early Application Experiences on a Modern GPU-Accelerated Arm-based HPC Platform.
CoRR, 2022

2021
High-throughput virtual laboratory for drug discovery using massive datasets.
Int. J. High Perform. Comput. Appl., 2021

Supercomputing Pipelines Search for Therapeutics Against COVID-19.
Comput. Sci. Eng., 2021

NVIDIA's Cloud Native Supercomputing.
Proceedings of the Driving Scientific and Engineering Discoveries Through the Integration of Experiment, Big Data, and Modeling and Simulation, 2021

Memory reduction using a ring abstraction over GPU RDMA for distributed quantum Monte Carlo solver.
Proceedings of the PASC '21: Platform for Advanced Scientific Computing Conference, 2021

A Case Study of LLVM-Based Analysis for Optimizing SIMD Code Generation.
Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021



ARBALEST: Dynamic Detection of Data Mapping Issues in Heterogeneous OpenMP Applications.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Combining Static and Dynamic Analysis to Query Characteristics of HPC Applications.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

Task-graph scheduling extensions for efficient synchronization and communication.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

OpenMP Target Task: Tasking and Target Offloading on Heterogeneous Systems.
Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021

Addressing Load Imbalance in Bioinformatics and Biomedical Applications: Efficient Scheduling across Multiple GPUs.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

2020
Supercomputer-Based Ensemble Docking Drug Discovery Pipeline with Application to Covid-19.
J. Chem. Inf. Model., 2020

Experiences in porting mini-applications to OpenACC and OpenMP on heterogeneous systems.
Concurr. Comput. Pract. Exp., 2020

Performance Analysis of a Quantum Monte Carlo Application on Multiple Hardware Architectures Using the HPX Runtime.
Proceedings of the 11th IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2020

Performance Portability of Molecular Docking Miniapp On Leadership Computing Platforms.
Proceedings of the IEEE/ACM International Workshop on Performance, 2020

A Study of Memory Anomalies in OpenMP Applications.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

Toward Supporting Multi-GPU Targets via Taskloop and User-Defined Schedules.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

Co-designing OpenMP Features Using OMPT and Simulation Tools.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

GPU-Accelerated Drug Discovery with Docking on the Summit Supercomputer: Porting, Optimization, and Application to COVID-19 Research.
Proceedings of the BCB '20: 11th ACM International Conference on Bioinformatics, 2020

2019
Analysis of OpenMP 4.5 Offloading in Implementations: Correctness and Overhead.
Parallel Comput., 2019

Evaluation of directive-based performance portable programming models.
Int. J. High Perform. Comput. Netw., 2019

MPI + OpenACC: Accelerating radiation transport mini-application, minisweep, on heterogeneous systems.
Comput. Phys. Commun., 2019

Are we witnessing the spectre of an HPC meltdown?
Concurr. Comput. Pract. Exp., 2019


Optimization of Condensed Matter Physics Application with OpenMP Tasking Model.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

2018
Compile-Time Library Call Detection Using CAASCADE and XALT.
Proceedings of the High Performance Computing, 2018

Porting DMRG++ Scientific Application to OpenPOWER.
Proceedings of the High Performance Computing, 2018

Evaluating Performance Portability of Accelerator Programming Models using SPEC ACCEL 1.2 Benchmarks.
Proceedings of the High Performance Computing, 2018

Abstractions and Directives for Adapting Wavefront Algorithms to Future Architectures.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2018

MemBrain: Automated Application Guidance for Hybrid Memory Systems.
Proceedings of the 2018 IEEE International Conference on Networking, 2018

OpenMP 4.5 Validation and Verification Suite for Device Offload.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

Evaluating Support for OpenMP Offload Features.
Proceedings of the 47th International Conference on Parallel Processing, 2018

MiniApp for Density Matrix Renormalization Group Hamiltonian Application Kernel.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

2017
OpenACC 2.5 Validation Testsuite Targeting Multiple Architectures.
Proceedings of the High Performance Computing, 2017

CAASCADE: A System for Static Analysis of HPC Software Application Portfolios.
Proceedings of the Programming and Performance Visualization Tools, 2017

One-Way Wave Equation Migration at Scale on GPUs Using Directive Based Programming.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Compiler technologies for understanding legacy scientific code: A case study on an ACME land module.
Proceedings of the International Conference on Computational Science, 2017

2016
Using C++ AMP to Accelerate HPC Applications on Multiple Platforms.
Proceedings of the High Performance Computing, 2016


Towards Achieving Performance Portability Using Directives for Accelerators.
Proceedings of the Third Workshop on Accelerator Programming Using Directives, 2016

Evaluating OpenMP Affinity on the POWER8 Architecture.
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016

2015
Exploring OpenSHMEM Model to Program GPU-based Extreme-Scale Systems.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies, 2015

CUDA Grid-Level Task Progression Algorithms.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015


2014
Extending the OpenSHMEM Memory Model to Support User-Defined Spaces.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Extending the OpenSHMEM Analyzer to Perform Synchronization and Multi-valued Analysis.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

OpenSHMEM Extensions and a Vision for Its Future Direction.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

A Global View Programming Abstraction for Transitioning MPI Codes to PGAS Languages.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

Analyzing the Energy and Power Consumption of Remote Memory Accesses in the OpenSHMEM Model.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

Hybrid Programming Using OpenSHMEM and OpenACC.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

A Validation Testsuite for OpenACC 1.0.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Power Consumption Due to Data Movement in Distributed Programming Models.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

HERCULES/PL: the pattern language of HERCULES.
Proceedings of the 1st Workshop on Programming Language Evolution, 2014

2013
Experiences Developing the OpenUH Compiler and Runtime Infrastructure.
Int. J. Parallel Program., 2013

KLONOS: Similarity-based planning tool support for porting scientific applications.
Concurr. Comput. Pract. Exp., 2013

Porting Applications with OpenMP Using Similarity Analysis.
Proceedings of the Languages and Compilers for Parallel Computing, 2013

Evaluating OpenMP Tasking at Scale for the Computation of Graph Hyperbolicity.
Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013

Toward Abstracting the Communication Intent in Applications to Improve Portability and Productivity.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

2012
HERCULES: A Pattern Driven Code Transformation System.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

A Similarity-Based Analysis Tool for Porting OpenMP Applications.
Proceedings of the Facing the Multicore-Challenge, 2012

2011
OpenSHMEM - Toward a Unified RMA Model.
Proceedings of the Encyclopedia of Parallel Computing, 2011

Experiences with High-Level Programming Directives for Porting Applications to GPUs.
Proceedings of the Facing the Multicore - Challenge II, 2011

2009
Scalability Evaluation of Barrier Algorithms for OpenMP.
Proceedings of the Evolving OpenMP in an Age of Extreme Parallelism, 2009

Open Source Software Support for the OpenMP Runtime API for Profiling.
Proceedings of the ICPPW 2009, 2009

2008
A performance tuning methodology with compiler support.
Sci. Program., 2008

Capturing performance knowledge for automated analysis.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

2007
OpenUH: an optimizing, portable OpenMP compiler.
Concurr. Comput. Pract. Exp., 2007

Compiler Support for Efficient Instrumentation.
Proceedings of the Parallel Computing: Architectures, 2007

Towards an Implementation of the OpenMP Collector API.
Proceedings of the Parallel Computing: Architectures, 2007

2006
Performance Instrumentation and Compiler Optimizations for MPI/OpenMP Applications.
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2006

2004
Dragon: A Static and Dynamic Tool for OpenMP.
Proceedings of the Shared Memory Parallel Programming with OpenMP, 2004

A Tool to Display Array Access Patterns in OpenMP Programs.
Proceedings of the Applied Parallel Computing, 2004

2003
Analyses for the Translation of OpenMP Codes into SPMD Style with Array Privatization.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2003

2002
Improving the Performance of OpenMP by Array Privatization.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2002

2001
Program Development Environment for OpenMP Programs on ccNUMA Architectures.
Proceedings of the Large-Scale Scientific Computing, Third International Conference, 2001


  Loading...