Pouya Kousha

Orcid: 0009-0004-7507-0940

According to our database1, Pouya Kousha authored at least 18 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Design and Implementation of an IPC-based Collective MPI Library for Intel GPUs.
Proceedings of the Practice and Experience in Advanced Research Computing 2024: Human Powered Computing, 2024

2023
SAI: AI-Enabled Speech Assistant Interface for Science Gateways in HPC.
Proceedings of the High Performance Computing - 38th International Conference, 2023

Democratizing HPC Access and Use with Knowledge Graphs.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

MPI-xCCL: A Portable MPI Library over Collective Communication Libraries for Various Accelerators.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Benchmarking Modern Databases for Storing and Profiling Very Large Scale HPC Communication Data.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2023

2022
Accelerating MPI All-to-All Communication with Online Compression on Modern GPU Clusters.
Proceedings of the High Performance Computing - 37th International Conference, 2022

"Hey CAI" - Conversational AI Enabled User Interface for HPC Tools.
Proceedings of the High Performance Computing - 37th International Conference, 2022

Hy-Fi: Hybrid Five-Dimensional Parallel DNN Training on High-Performance GPU Clusters.
Proceedings of the High Performance Computing - 37th International Conference, 2022

2021
Cross-layer Visualization and Profiling of Network and I/O Communication for HPC Clusters.
CoRR, 2021

INAM: Cross-stack Profiling and Analysis of Communication in MPI-based Applications.
Proceedings of the PEARC '21: Practice and Experience in Advanced Research Computing, 2021

Designing High-Performance MPI Libraries with On-the-fly Compression for Modern GPU Clusters<sup>*</sup>.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

DistMILE: A Distributed Multi-Level Framework for Scalable Graph Embedding.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

2020
Accelerated Real-time Network Monitoring and Profiling at Scale using OSU INAM.
Proceedings of the PEARC '20: Practice and Experience in Advanced Research Computing, 2020

NV-group: link-efficient reduction for distributed deep learning on modern dense GPU systems.
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020

2019
Efficient design for MPI asynchronous progress without dedicated resources.
Parallel Comput., 2019

Designing a Profiling and Visualization Tool for Scalable and In-depth Analysis of High-Performance GPU Clusters.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

2018
Efficient Asynchronous Communication Progress for MPI without Dedicated Resources.
Proceedings of the 25th European MPI Users' Group Meeting, 2018

SALaR: Scalable and Adaptive Designs for Large Message Reduction Collectives.
Proceedings of the IEEE International Conference on Cluster Computing, 2018


  Loading...