2025
Teaching An Old Dog New Tricks: Porting Legacy Code to Heterogeneous Compute Architectures With Automated Code Translation.
CoRR, February, 2025
Monolithic Algebraic Multigrid Preconditioners for the Stokes Equations.
SIAM J. Sci. Comput., 2025
Corrigendum to "A TVD neural network closure and application to turbulent combustion" [Journal of Computational Physics 523 (2025)/113638].
J. Comput. Phys., 2025
A TVD neural network closure and application to turbulent combustion.
J. Comput. Phys., 2025
Exploiting mesh structure to improve multigrid performance for saddle-point problems.
Int. J. High Perform. Comput. Appl., 2025
2024
Generalizing reduction-based algebraic multigrid.
Numer. Linear Algebra Appl., May, 2024
Generalizing Lloyd's Algorithm for Graph Clustering.
SIAM J. Sci. Comput., 2024
Monolithic Multigrid Preconditioners for High-Order Discretizations of Stokes Equations.
CoRR, 2024
Learning from Integral Losses in Physics Informed Neural Networks.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
2023
Parallel Energy-Minimization Prolongation for Algebraic Multigrid.
SIAM J. Sci. Comput., October, 2023
Performance Analysis and Optimal Node-aware Communication for Enlarged Conjugate Gradient Methods.
ACM Trans. Parallel Comput., March, 2023
Characterizing the performance of node-aware strategies for irregular point-to-point communication on heterogeneous architectures.
Parallel Comput., 2023
PyAMG: Algebraic Multigrid Solvers in Python.
J. Open Source Softw., 2023
Coarse-grid selection using simulated annealing.
J. Comput. Appl. Math., 2023
MG-GNN: Multigrid Graph Neural Networks for Learning Multilevel Domain Decomposition Methods.
Proceedings of the International Conference on Machine Learning, 2023
2022
Tausch: A halo exchange library for large heterogeneous computing systems using MPI, OpenCL, and CUDA.
Parallel Comput., 2022
Low-order preconditioning of the Stokes equations.
Numer. Linear Algebra Appl., 2022
PyAMG: Algebraic Multigrid Solvers in Python.
J. Open Source Softw., 2022
Optimized Sparse Matrix Operations for Reverse Mode Automatic Differentiation.
CoRR, 2022
On Computing Coercivity Constants in Linear Variational Problems Through Eigenvalue Analysis.
CoRR, 2022
Performance of Low Synchronization Orthogonalization Methods in Anderson Accelerated Fixed Point Solvers.
Proceedings of the 2022 SIAM Conference on Parallel Processing for Scientific Computing, 2022
Learning Interface Conditions in Domain Decomposition Solvers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
2021
A Least-Squares Finite Element Reduced Basis Method.
SIAM J. Sci. Comput., 2021
Reduced Basis Approximations of Parameterized Dynamical Partial Differential Equations via Neural Networks.
CoRR, 2021
Optimization-Based Algebraic Multigrid Coarsening Using Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Modeling Data Movement Performance on Heterogeneous Architectures.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021
2020
Scalable line and plane relaxation in a parallel structured multigrid solver.
Parallel Comput., 2020
FFT, FMM, and multigrid on the road to exascale: Performance challenges and opportunities.
J. Parallel Distributed Comput., 2020
Reducing communication in algebraic multigrid with multi-step node aware communication.
Int. J. High Perform. Comput. Appl., 2020
2019
Node aware sparse matrix-vector multiplication.
J. Parallel Distributed Comput., 2019
A massively scalable distributed multigrid framework for nonlinear marine hydrodynamics.
Int. J. High Perform. Comput. Appl., 2019
Exploring the feasibility of lossy compression for PDE simulations.
Int. J. High Perform. Comput. Appl., 2019
Node-Aware Improvements to Allreduce.
CoRR, 2019
FaultSight: A Fault Analysis Tool for HPC Researchers.
Proceedings of the 9th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, 2019
Learning with Analytical Models.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019
2018
Scaling Structured Multigrid to 500K+ Cores Through Coarse-Grid Redistribution.
SIAM J. Sci. Comput., 2018
High-order finite element-integral equation coupling on embedded meshes.
J. Comput. Phys., 2018
Improving Performance Models for Irregular Point-to-Point Communication.
Proceedings of the 25th European MPI Users' Group Meeting, 2018
2017
A Root-Node-Based Algebraic Multigrid Method.
SIAM J. Sci. Comput., 2017
Efficient parallel optimization of volume meshes on heterogeneous computing systems.
Eng. Comput., 2017
Towards a More Complete Understanding of SDC Propagation.
Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing, 2017
2016
Reducing Parallel Communication in Algebraic Multigrid through Sparsification.
SIAM J. Sci. Comput., 2016
A Finite Element Based P<sup>3</sup>M Method for N-Body Problems.
SIAM J. Sci. Comput., 2016
A hybrid format for better performance of sparse matrix-vector multiplication on a GPU.
Int. J. High Perform. Comput. Appl., 2016
TAPSpMV: Topology-Aware Parallel Sparse Matrix Vector Multiplication.
CoRR, 2016
Modeling MPI Communication Performance on SMP Nodes: Is it Time to Retire the Ping Pong Test.
Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, 2016
IPAS: intelligent protection against silent output corruption in scientific applications.
Proceedings of the 2016 International Symposium on Code Generation and Optimization, 2016
2015
Optimizing Sparse Matrix - Matrix Multiplication for the GPU.
ACM Trans. Math. Softw., 2015
A Finite Element Based P3M Method for N-body Problems.
CoRR, 2015
Towards a more fault resilient multigrid solver.
Proceedings of the Symposium on High Performance Computing, 2015
Optimizing Sparse Matrix Operations on GPUs Using Merge Path.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
Understanding the Propagation of Error Due to a Silent Data Corruption in a Sparse Matrix Vector Multiply.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
2014
Enhancing Least-Squares Finite Element Methods Through a Quantity-of-Interest.
SIAM J. Numer. Anal., 2014
Theoretical bounds for algebraic multigrid performance: review and analysis.
Numer. Linear Algebra Appl., 2014
FlipIt: An LLVM Based Fault Injector for HPC.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014
2013
Efficient GPU-based Optimization of Volume Meshes.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013
2012
Exposing Fine-Grained Parallelism in Algebraic Multigrid Methods.
SIAM J. Sci. Comput., 2012
A weighted adaptive least-squares finite element method for the Poisson-Boltzmann equation.
Appl. Math. Comput., 2012
2011
A General Interpolation Strategy for Algebraic Multigrid Using Energy Minimization.
SIAM J. Sci. Comput., 2011
Algebraic Multigrid for High-Order Hierarchical H(curl) Finite Elements.
SIAM J. Sci. Comput., 2011
Finite Element Approximation to a Finite-Size Modified Poisson-Boltzmann Equation.
J. Sci. Comput., 2011
Smoothed aggregation multigrid solvers for high-order discontinuous Galerkin methods for elliptic problems.
J. Comput. Phys., 2011
2010
A new perspective on strength measures in algebraic multigrid.
Numer. Linear Algebra Appl., 2010
Smoothed aggregation for Helmholtz problems.
Numer. Linear Algebra Appl., 2010
A spectral boundary integral method for flowing blood cells.
J. Comput. Phys., 2010
A first-order system least-squares finite element method for the Poisson-Boltzmann equation.
J. Comput. Chem., 2010
2008
Algebraic multigrid for <i>k</i>-form Laplacians.
Numer. Linear Algebra Appl., 2008
2007
Algebraic Multigrid Preconditioning of High-Order Spectral Elements for Elliptic Problems on a Simplicial Mesh.
SIAM J. Sci. Comput., 2007
Parallel coarse-grid selection.
Numer. Linear Algebra Appl., 2007
2005
Numerical Conservation Properties of H(div)-Conforming Least-Squares Finite Element Methods for the Burgers Equation.
SIAM J. Sci. Comput., 2005
2004
Least-Squares Finite Element Methods and Algebraic Multigrid Solvers for Linear Hyperbolic PDEs.
SIAM J. Sci. Comput., 2004