Performance Analysis of HPC Applications on the Aurora Supercomputer: Exploring the Impact of HBM-Enabled Intel Xeon Max CPUs.
Proceedings of the ISC High Performance 2025 Research Paper Proceedings (40th International Conference), 2025
FFT, FMM, and multigrid on the road to exascale: Performance challenges and opportunities.
J. Parallel Distributed Comput., 2020
Simulation-Based Performance Prediction of HPC Applications: A Case Study of HPL.
Proceedings of the IEEE/ACM International Workshop on HPC User Support Tools and Workshop on Programming and Performance Visualization Tools, 2020
Learning with Analytical Models.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019
Fast multipole preconditioners for sparse matrices arising from elliptic equations.
Comput. Vis. Sci., 2018
Communication Reducing Algorithms for Distributed Hierarchical N-Body Problems with Boundary Distributions.
Proceedings of the High Performance Computing - 32nd International Conference, 2017
Fast Multipole-Based Elliptic PDE Solver and Preconditioner.
PhD thesis, 2016
A generic buffer occupancy expression for stop-and-wait hybrid automatic repeat request protocol over unstable channels.
Telecommun. Syst., 2016
A performance model for the communication in fast multipole methods on high-performance computing platforms.
Int. J. High Perform. Comput. Appl., 2016
Fast Multipole Method as a Matrix-Free Hierarchical Low-Rank Approximation.
CoRR, 2016
A Matrix-free Preconditioner for the Helmholtz Equation based on the Fast Multipole Method.
CoRR, 2016
Petascale molecular dynamics simulation using the fast multipole method on K computer.
,
,
,
,
,
,
,
,
,
,
Comput. Phys. Commun., 2014
A Performance Model for the Communication in Fast Multipole Methods on HPC Platforms.
CoRR, 2014