Pouya Kousha

Kawthar Shafie Khorassani

Proceedings of the Practice and Experience in Advanced Research Computing 2024: Human Powered Computing, 2024

2023

SAI: AI-Enabled Speech Assistant Interface for Science Gateways in HPC.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 38th International Conference, 2023

Democratizing HPC Access and Use with Knowledge Graphs.

[BibT_eX]

[DOI]

Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators.

[BibT_eX]

[DOI]

Chen-Chun Chen

Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Benchmarking Modern Databases for Storing and Profiling Very Large Scale HPC Communication Data.

[BibT_eX]

[DOI]

Proceedings of the Benchmarking, Measuring, and Optimizing, 2023

2022

Accelerating MPI All-to-All Communication with Online Compression on Modern GPU Clusters.

[BibT_eX]

[DOI]

Qinghua Zhou

Kawthar Shafie Khorassani

Quentin Anthony

Aamir Shafi

Proceedings of the High Performance Computing - 37th International Conference, 2022

"Hey CAI" - Conversational AI Enabled User Interface for HPC Tools.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 37th International Conference, 2022

Hy-Fi: Hybrid Five-Dimensional Parallel DNN Training on High-Performance GPU Clusters.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing - 37th International Conference, 2022

2021

Cross-layer Visualization and Profiling of Network and I/O Communication for HPC Clusters.

[BibT_eX]

[DOI]

CoRR, 2021

INAM: Cross-stack Profiling and Analysis of Communication in MPI-based Applications.

[BibT_eX]

[DOI]

Kamal Raj Sankarapandian Dayala Ganesh Ram

Proceedings of the PEARC '21: Practice and Experience in Advanced Research Computing, 2021

Designing High-Performance MPI Libraries with On-the-fly Compression for Modern GPU Clusters<sup>*</sup>.

[BibT_eX]

[DOI]

Seyedeh Mahdieh Ghazimirsaeed

Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

DistMILE: A Distributed Multi-Level Framework for Scalable Graph Embedding.

[BibT_eX]

[DOI]

Srinivasan Parthasarathy

Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

2020

Accelerated Real-time Network Monitoring and Profiling at Scale using OSU INAM.

[BibT_eX]

[DOI]

Proceedings of the PEARC '20: Practice and Experience in Advanced Research Computing, 2020

NV-group: link-efficient reduction for distributed deep learning on modern dense GPU systems.

[BibT_eX]

[DOI]

Ching-Hsiang Chu

Kawthar Shafie Khorassani

Ammar Ahmad Awan

Dhabaleswar K. D. K. Panda

Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

2019

Efficient design for MPI asynchronous progress without dedicated resources.

[BibT_eX]

[DOI]

Amit Ruhela

Sourav Chakraborty

Mohammadreza Bayatpour

Parallel Comput., 2019

Designing a Profiling and Visualization Tool for Scalable and In-depth Analysis of High-Performance GPU Clusters.

[BibT_eX]

[DOI]

Bharath Ramesh

Kaushik Kandadi Suresh

Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

2018

Efficient Asynchronous Communication Progress for MPI without Dedicated Resources.

[BibT_eX]

[DOI]

Amit Ruhela

Sourav Chakraborty

Mohammadreza Bayatpour