2025
A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading.
CoRR, March, 2025

2024
Towards accelerating particle-resolved direct numerical simulation with neural operators.
Stat. Anal. Data Min., June, 2024

Unpaired image translation to mitigate domain shift in liquid argon time projection chamber detector responses.
Mach. Learn. Sci. Technol., 2024

Quantum-centric supercomputing for materials science: A perspective on challenges and future directions.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Future Gener. Comput. Syst., 2024

Fourier neural operators for spatiotemporal dynamics in two-dimensional turbulence.
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024

2023
Accelerating scientific discoveries through data-driven innovations.
Patterns, November, 2023

Evaluating Portable Parallelization Strategies for Heterogeneous Architectures in High Energy Physics.
CoRR, 2023

Unsupervised Domain Transfer for Science: Exploring Deep Learning Methods for Translation between LArTPC Detector Simulations with Differing Response Models.
CoRR, 2023

Portable Programming Model Exploration for LArTPC Simulation in a Heterogeneous Computing Environment: OpenMP vs. SYCL.
CoRR, 2023

Rethinking CycleGAN: Improving Quality of GANs for Unpaired Image-to-Image Translation.
CoRR, 2023

OpenMP Advisor.
CoRR, 2023

UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

OpenMP Advisor: A Compiler Tool for Heterogeneous Architectures.
Proceedings of the OpenMP: Advanced Task-Based, Device and Compiler Programming, 2023

2022
OpenMP application experiences: Porting to accelerated nodes.
Parallel Comput., 2022

COMPOFF: A Compiler Cost model using Machine Learning to predict the Cost of OpenMP Offloading.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

Methods and Results for Quantum Optimal Pulse Control on Superconducting Qubit Systems.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

2021
Porting HEP Parameterized Calorimeter Simulation Code to GPUs.
Frontiers Big Data, 2021

Evaluation of Portable Acceleration Solutions for LArTPC Simulation Using Wire-Cell Toolkit.
CoRR, 2021

Outcomes of OpenMP Hackathon: OpenMP Application Experiences with the Offloading Model (Part II).
Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021

Outcomes of OpenMP Hackathon: OpenMP Application Experiences with the Offloading Model (Part I).
Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021

2020
Approximate Inverse Chain Preconditioner: Iteration Count Case Study for Spectral Support Solvers.
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020

2018
Best Practices in Running Collaborative GPU Hackathons: Advancing Scientific Applications with a Sustained Impact.
Comput. Sci. Eng., 2018

High-Performance Multi-Mode Ptychography Reconstruction on Distributed GPUs.
CoRR, 2018

2017
Performance Portability Strategies for Grid C++ Expression Templates.
CoRR, 2017

2016
A Scalable Task Parallelism Approach for LU Decomposition with Multicore CPUs.
Proceedings of the Second International Workshop on Extreme Scale Programming Models and Middleware, 2016

2015
Polyhedral user mapping and assistant visualizer tool for the r-stream auto-parallelizing compiler.
Proceedings of the 3rd IEEE Working Conference on Software Visualization, 2015

2014
Accelerating Ab Initio Nucleon Structure Calculations with All-Mode-Averaging on Gordon.
Proceedings of the Annual Conference of the Extreme Science and Engineering Discovery Environment, 2014