A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading.
CoRR, March, 2025
Towards accelerating particle-resolved direct numerical simulation with neural operators.
,
,
,
,
,
,
,
,
,
,
Stat. Anal. Data Min., June, 2024
Unpaired image translation to mitigate domain shift in liquid argon time projection chamber detector responses.
Mach. Learn. Sci. Technol., 2024
Fourier neural operators for spatiotemporal dynamics in two-dimensional turbulence.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the SC24-W: Workshops of the International Conference for High Performance Computing, 2024
Accelerating scientific discoveries through data-driven innovations.
Patterns, November, 2023
Evaluating Portable Parallelization Strategies for Heterogeneous Architectures in High Energy Physics.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Unsupervised Domain Transfer for Science: Exploring Deep Learning Methods for Translation between LArTPC Detector Simulations with Differing Response Models.
CoRR, 2023
Portable Programming Model Exploration for LArTPC Simulation in a Heterogeneous Computing Environment: OpenMP vs. SYCL.
CoRR, 2023
Rethinking CycleGAN: Improving Quality of GANs for Unpaired Image-to-Image Translation.
CoRR, 2023
UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
OpenMP Advisor: A Compiler Tool for Heterogeneous Architectures.
Proceedings of the OpenMP: Advanced Task-Based, Device and Compiler Programming, 2023
COMPOFF: A Compiler Cost model using Machine Learning to predict the Cost of OpenMP Offloading.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022
Methods and Results for Quantum Optimal Pulse Control on Superconducting Qubit Systems.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022
Porting HEP Parameterized Calorimeter Simulation Code to GPUs.
Frontiers Big Data, 2021
Evaluation of Portable Acceleration Solutions for LArTPC Simulation Using Wire-Cell Toolkit.
CoRR, 2021
Outcomes of OpenMP Hackathon: OpenMP Application Experiences with the Offloading Model (Part II).
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021
Outcomes of OpenMP Hackathon: OpenMP Application Experiences with the Offloading Model (Part I).
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021
Approximate Inverse Chain Preconditioner: Iteration Count Case Study for Spectral Support Solvers.
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020
Best Practices in Running Collaborative GPU Hackathons: Advancing Scientific Applications with a Sustained Impact.
Comput. Sci. Eng., 2018
High-Performance Multi-Mode Ptychography Reconstruction on Distributed GPUs.
CoRR, 2018
Performance Portability Strategies for Grid C++ Expression Templates.
CoRR, 2017
A Scalable Task Parallelism Approach for LU Decomposition with Multicore CPUs.
Proceedings of the Second International Workshop on Extreme Scale Programming Models and Middleware, 2016
Polyhedral user mapping and assistant visualizer tool for the r-stream auto-parallelizing compiler.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 3rd IEEE Working Conference on Software Visualization, 2015
Accelerating Ab Initio Nucleon Structure Calculations with All-Mode-Averaging on Gordon.
Proceedings of the Annual Conference of the Extreme Science and Engineering Discovery Environment, 2014