Sayantan Sur
According to our database1,
Sayantan Sur
authored at least 42 papers
between 2004 and 2020.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2020
Minimizing the usage of hardware counters for collective communication using triggered operations.
Parallel Comput., 2020
2019
Parallel Comput., 2019
2017
Proceedings of the International Conference for High Performance Computing, 2017
2016
Proceedings of the OpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments, 2016
2015
A Brief Introduction to the OpenFabrics Interfaces - A New Network API for Maximizing High Performance Application Efficiency.
Proceedings of the 23rd IEEE Annual Symposium on High-Performance Interconnects, 2015
2014
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014
2011
Proceedings of the Encyclopedia of Parallel Computing, 2011
Comput. Sci. Res. Dev., 2011
High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT.
Comput. Sci. Res. Dev., 2011
Optimizing MPI One Sided Communication on Multi-core InfiniBand Clusters Using Shared Memory Backed Windows.
Proceedings of the Recent Advances in the Message Passing Interface, 2011
Design and Implementation of Key Proposed MPI-3 One-Sided Communication Semantics on InfiniBand.
Proceedings of the Recent Advances in the Message Passing Interface, 2011
Proceedings of the International Conference on Parallel Processing, 2011
Designing Non-blocking Broadcast with Collective Offload on InfiniBand Clusters: A Case Study with HPL.
Proceedings of the IEEE 19th Annual Symposium on High Performance Interconnects, 2011
Multi-threaded UPC runtime with network endpoints: Design alternatives and evaluation on multi-core architectures.
Proceedings of the 18th International Conference on High Performance Computing, 2011
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011
Optimized Non-contiguous MPI Datatype Communication for GPU Clusters: Design, Implementation and Evaluation with MVAPICH2.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011
Design and Evaluation of Network Topology-/Speed- Aware Broadcast Algorithms for InfiniBand Clusters.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011
MPI Alltoall Personalized Exchange on GPGPU Clusters: Design Alternatives and Benefit.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011
2010
Comput. Sci. Res. Dev., 2010
Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, 2010
Quantifying performance benefits of overlap using MPI-2 in a seismic modeling application.
Proceedings of the 24th International Conference on Supercomputing, 2010
High Performance Design and Implementation of Nemesis Communication Layer for Two-Sided and One-Sided MPI Semantics in MVAPICH2.
Proceedings of the 39th International Conference on Parallel Processing, 2010
Improving Application Performance and Predictability Using Multiple Virtual Lanes in Modern Multi-core InfiniBand Clusters.
Proceedings of the 39th International Conference on Parallel Processing, 2010
Proceedings of the 39th International Conference on Parallel Processing, 2010
Design and Evaluation of Generalized Collective Communication Primitives with Overlap Using ConnectX-2 Offload Engine.
Proceedings of the IEEE 18th Annual Symposium on High Performance Interconnects, 2010
Proceedings of the IEEE 18th Annual Symposium on High Performance Interconnects, 2010
2009
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009
2007
High performance MPI design using unreliable datagram for ultra-scale InfiniBand clusters.
Proceedings of the 21th Annual International Conference on Supercomputing, 2007
Performance Analysis and Evaluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms.
Proceedings of the 15th Annual IEEE Symposium on High-Performance Interconnects, 2007
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007
Lightweight kernel-level primitives for high-performance MPI intra-node communication over multi-core systems.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007
2006
MPI and communication - High-performance and scalable MPI over InfiniBand with reduced memory usage: an in-depth performance analysis.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006
RDMA read based rendezvous protocol for MPI over InfiniBand: design alternatives and benefits.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006
2005
Int. J. High Perform. Comput. Appl., 2005
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005
Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005
Proceedings of the 13th Annual IEEE Symposium on High Performance Interconnects (HOTIC 2005), 2005
Proceedings of the High Performance Computing, 2005
2004
Efficient and Scalable All-to-All Personalized Exchange for InfiniBand-Based Clusters.
Proceedings of the 33rd International Conference on Parallel Processing (ICPP 2004), 2004