Kaushik Kandadi Suresh

Orcid: 0000-0002-3705-2387

According to our database1, Kaushik Kandadi Suresh authored at least 15 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
HINT: Designing Cache-Efficient MPI_Alltoall using Hybrid Memory Copy Ordering and Non-Temporal Instructions.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023
Network-Assisted Noncontiguous Transfers for GPU-Aware MPI Libraries.
IEEE Micro, 2023

DPU-Bench: A Micro-Benchmark Suite to Measure Offload Efficiency Of SmartNICs.
Proceedings of the Practice and Experience in Advanced Research Computing, 2023

A Novel Framework for Efficient Offloading of Communication Operations to Bluefield SmartNICs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

In-Depth Evaluation of a Lower-Level Direct-Verbs API on InfiniBand-based Clusters: Early Experiences.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Enabling Reconfigurable HPC through MPI-based Inter-FPGA Communication.
Proceedings of the 37th International Conference on Supercomputing, 2023

Designing In-network Computing Aware Reduction Collectives in MPI.
Proceedings of the IEEE Symposium on High-Performance Interconnects, 2023

Battle of the BlueFields: An In-Depth Comparison of the BlueField-2 and BlueField-3 SmartNICs.
Proceedings of the IEEE Symposium on High-Performance Interconnects, 2023

2022
Network Assisted Non-Contiguous Transfers for GPU-Aware MPI Libraries.
Proceedings of the IEEE Symposium on High-Performance Interconnects, 2022

Efficient Personalized and Non-Personalized Alltoall Communication for Modern Multi-HCA GPU-Based Clusters.
Proceedings of the 29th IEEE International Conference on High Performance Computing, 2022

2021
Layout-aware Hardware-assisted Designs for Derived Data Types in MPI.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

2020
Communication-Aware Hardware-Assisted MPI Overlap Engine.
Proceedings of the High Performance Computing - 35th International Conference, 2020

Scalable MPI Collectives using SHARP: Large Scale Performance Evaluation on the TACC Frontera System.
Proceedings of the Workshop on Exascale MPI, 2020

Performance Characterization of Network Mechanisms for Non-Contiguous Data Transfers in MPI.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019
Designing a Profiling and Visualization Tool for Scalable and In-depth Analysis of High-Performance GPU Clusters.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019


  Loading...