Krishna Chaitanya Kandalla
According to our database1,
Krishna Chaitanya Kandalla
authored at least 28 papers
between 2009 and 2014.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2014
Initial study of multi-endpoint runtime for MPI+OpenMP hybrid programming model on multi-core systems.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014
Designing Topology-Aware Communication Schedules for Alltoall Operations in Large InfiniBand Clusters.
Proceedings of the 43rd International Conference on Parallel Processing, 2014
2013
MVAPICH-PRISM: a proxy-based communication framework using InfiniBand and SCIF for intel MIC clusters.
Proceedings of the International Conference for High Performance Computing, 2013
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013
MIC-RO: enabling efficient remote offload on heterogeneous many integrated core (MIC) clusters with InfiniBand.
Proceedings of the International Conference on Supercomputing, 2013
A Novel Functional Partitioning Approach to Design High-Performance MPI-3 Non-blocking Alltoallv Collective on Multi-core Systems.
Proceedings of the 42nd International Conference on Parallel Processing, 2013
Designing Optimized MPI Broadcast and Allreduce for Many Integrated Core (MIC) InfiniBand Clusters.
Proceedings of the IEEE 21st Annual Symposium on High-Performance Interconnects, 2013
Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013
2012
Design of a scalable InfiniBand topology service to enable network-topology-aware placement of processes.
Proceedings of the SC Conference on High Performance Computing Networking, 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012
Designing Non-blocking Allreduce with Collective Offload on InfiniBand Clusters: A Case Study with Conjugate Gradient Solvers.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012
Supporting Hybrid MPI and OpenSHMEM over InfiniBand: Design and Performance Evaluation.
Proceedings of the 41st International Conference on Parallel Processing, 2012
Can Network-Offload Based Non-blocking Neighborhood MPI Collectives Improve Communication Overheads of Irregular Graph Algorithms?
Proceedings of the 2012 IEEE International Conference on Cluster Computing Workshops, 2012
Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012
2011
Proceedings of the Encyclopedia of Parallel Computing, 2011
High-performance and scalable non-blocking all-to-all with collective offload on InfiniBand clusters: a study with parallel 3D FFT.
Comput. Sci. Res. Dev., 2011
Designing Non-blocking Broadcast with Collective Offload on InfiniBand Clusters: A Case Study with HPL.
Proceedings of the IEEE 19th Annual Symposium on High Performance Interconnects, 2011
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011
Design and Evaluation of Network Topology-/Speed- Aware Broadcast Algorithms for InfiniBand Clusters.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011
MPI Alltoall Personalized Exchange on GPGPU Clusters: Design Alternatives and Benefit.
Proceedings of the 2011 IEEE International Conference on Cluster Computing (CLUSTER), 2011
2010
Designing topology-aware collective communication algorithms for large scale InfiniBand clusters: Case studies with Scatter and Gather.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010
High Performance Design and Implementation of Nemesis Communication Layer for Two-Sided and One-Sided MPI Semantics in MVAPICH2.
Proceedings of the 39th International Conference on Parallel Processing, 2010
Proceedings of the 39th International Conference on Parallel Processing, 2010
Design and Evaluation of Generalized Collective Communication Primitives with Overlap Using ConnectX-2 Offload Engine.
Proceedings of the IEEE 18th Annual Symposium on High Performance Interconnects, 2010
2009
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009