Pavan Balaji
Orcid: 0000-0001-7830-0001Affiliations:
- Argonne National Laboratory
According to our database1,
Pavan Balaji
authored at least 225 papers
between 2002 and 2024.
Collaborative distances:
Collaborative distances:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression.
Proceedings of the International Conference for High Performance Computing, 2024
Proceedings of the 61st ACM/IEEE Design Automation Conference, 2024
IEEE Trans. Parallel Distributed Syst., 2023
IEEE Trans. Parallel Distributed Syst., 2021
Proceedings of the International Conference for High Performance Computing, 2021
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart Networks, 2021
Proceedings of the IEEE International Conference on Cluster Computing, 2021
Proceedings of the 21st IEEE/ACM International Symposium on Cluster, 2021
IEEE Trans. Parallel Distributed Syst., 2020
IEEE Trans. Parallel Distributed Syst., 2020
IEEE Trans. Computers, 2020
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the Workshop on Exascale MPI, 2020
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020
Proceedings of the 22nd IEEE International Conference on High Performance Computing and Communications; 18th IEEE International Conference on Smart City; 6th IEEE International Conference on Data Science and Systems, 2020
ACM Trans. Parallel Comput., 2019
International workshop on programming models and applications for multicores and manycores (PMAM 2018).
Parallel Comput., 2019
Foreword to the special issue for the Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2 2017).
Parallel Comput., 2019
Characterization of Power Usage and Performance in Data-Intensive Applications Using MapReduce over MPI.
Proceedings of the Parallel Computing: Technology Trends, 2019
Proceedings of the ACM International Conference on Supercomputing, 2019
Proceedings of the 48th International Conference on Parallel Processing, 2019
Proceedings of the 21st IEEE International Conference on High Performance Computing and Communications; 17th IEEE International Conference on Smart City; 5th IEEE International Conference on Data Science and Systems, 2019
Proceedings of the 28th International Conference on Parallel Architectures and Compilation Techniques, 2019
IEEE Trans. Parallel Distributed Syst., 2018
IEEE Trans. Parallel Distributed Syst., 2018
Exploring the interoperability of remote GPGPU virtualization using rCUDA and directive-based programming models.
J. Supercomput., 2018
On the adequacy of lightweight thread approaches for high-level parallel programming models.
Future Gener. Comput. Syst., 2018
Proceedings of the International Conference for High Performance Computing, 2018
Proceedings of the International Conference for High Performance Computing, 2018
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018
Proceedings of the 24th IEEE International Conference on Parallel and Distributed Systems, 2018
Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing, 2018
Proceedings of the Big Data - BigData 2018, 2018
Enabling scalable and accurate clustering of distributed ligand geometries on supercomputers.
Parallel Comput., 2017
Int. J. High Perform. Comput. Appl., 2017
Int. J. High Perform. Comput. Appl., 2017
Foreword to the Special Issue of the workshop on the seventh international workshop on programming models and applications for multicores and manycores (PMAM 2016).
Concurr. Comput. Pract. Exp., 2017
Proceedings of the International Conference for High Performance Computing, 2017
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017
Proceedings of the 46th International Conference on Parallel Processing, 2017
Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017
Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017
Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017
Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems, 2017
Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017
Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017
Proceedings of the 24th IEEE International Conference on High Performance Computing, 2017
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017
IEEE Trans. Parallel Distributed Syst., 2016
IEEE Syst. J., 2016
A data-oriented profiler to assist in data partitioning and distribution for heterogeneous memory in HPC.
Parallel Comput., 2016
Special Issue on Parallel Programming Models and Systems Software for High-End Computing.
Parallel Comput., 2016
Parallel Comput., 2016
Performance analysis of data intensive cloud systems based on data management and replication: a survey.
Distributed Parallel Databases, 2016
Concurr. Comput. Pract. Exp., 2016
Concurr. Comput. Pract. Exp., 2016
Work stealing for GPU-accelerated parallel programs in a global address space framework.
Concurr. Comput. Pract. Exp., 2016
A survey and taxonomy on energy efficient resource allocation techniques for cloud computing systems.
Computing, 2016
Proceedings of the OpenMP: Memory, Devices, and Tasks, 2016
Proceedings of the 15th International Symposium on Parallel and Distributed Computing, 2016
Proceedings of the 45th International Conference on Parallel Processing, 2016
One-Sided Interface for Matrix Operations Using MPI-3 RMA: A Case Study with Elemental.
Proceedings of the 45th International Conference on Parallel Processing, 2016
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016
Proceedings of the Handbook on Data Centers, 2015
Improving concurrency and asynchrony in multithreaded MPI applications using software offloading.
Proceedings of the International Conference for High Performance Computing, 2015
Proceedings of the International Conference for High Performance Computing, 2015
Proceedings of the International Conference for High Performance Computing, 2015
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015
Proceedings of the 21st IEEE International Conference on Parallel and Distributed Systems, 2015
Versioned Distributed Arrays for Resilience in Scientific Applications: Global View Resilience.
Proceedings of the International Conference on Computational Science, 2015
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015
Exploring the Suitability of Remote GPGPU Virtualization for the OpenACC Programming Model Using rCUDA.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015
Analyzing MPI-3.0 Process-Level Shared Memory: A Case Study with Stencil Computations.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
SWAP-Assembler 2: Scalable Genome Assembler towards Millions of Cores - Practice and Experience.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
Characterizing MPI and Hybrid MPI+Threads Applications at Scale: Case Study with BFS.
Proceedings of the 15th IEEE/ACM International Symposium on Cluster, 2015
IEEE Trans. Parallel Distributed Syst., 2014
Special issue on programming models and applications for multicores and manycores - Guest Editors' Introduction.
Parallel Comput., 2014
Int. J. High Perform. Comput. Appl., 2014
BMC Bioinform., 2014
Proceedings of the International Conference for High Performance Computing, 2014
Proceedings of the International Conference for High Performance Computing, 2014
Proceedings of the 2014 Workshop on Exascale MPI, 2014
Proceedings of the 21st European MPI Users' Group Meeting, 2014
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014
Proceedings of the 2014 International Conference on Supercomputing, 2014
Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014
Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014
Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014
J. Supercomput., 2013
Parallel Comput., 2013
Special issue on programming models, systems software, and tools for High-End Computing.
Parallel Comput., 2013
Guest Editors' Introduction: Special Issue on Applications for the Heterogeneous Computing Era.
Int. J. High Perform. Comput. Appl., 2013
Future Gener. Comput. Syst., 2013
MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory.
Computing, 2013
Clust. Comput., 2013
Proceedings of the Supercomputing - 28th International Supercomputing Conference, 2013
Proceedings of the 20th European MPI Users's Group Meeting, 2013
Proceedings of the 20th European MPI Users's Group Meeting, 2013
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013
Proceedings of the 42nd International Conference on Parallel Processing, 2013
Enhancing Performance Portability of MPI Applications through Annotation-Based Transformations.
Proceedings of the 42nd International Conference on Parallel Processing, 2013
Proceedings of the 19th IEEE International Conference on Parallel and Distributed Systems, 2013
Proceedings of the 19th IEEE International Conference on Parallel and Distributed Systems, 2013
Proceedings of the IEEE 33rd International Conference on Distributed Computing Systems, 2013
Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing, 2013
Proceedings of the Euro-Par 2013 Parallel Processing, 2013
Proceedings of the IEEE 11th International Conference on Dependable, 2013
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013
Optimizing Burrows-Wheeler Transform-Based Sequence Alignment on Multicore Architectures.
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013
Int. J. High Perform. Comput. Appl., 2012
Proceedings of the Recent Advances in the Message Passing Interface, 2012
Proceedings of the Recent Advances in the Message Passing Interface, 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012
MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-based Systems.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012
Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012
Int. J. High Perform. Comput. Appl., 2011
Special Issue on Programming Models and Systems Software Support for High-End Computing Applications.
Int. J. High Perform. Comput. Appl., 2011
Mapping communication layouts to network hardware characteristics on massive-scale blue gene systems.
Comput. Sci. Res. Dev., 2011
Poster: High-level, one-sided programming models on MPI: a case study with global arrays and NWChem.
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011
Proceedings of the Recent Advances in the Message Passing Interface, 2011
Proceedings of the Recent Advances in the Message Passing Interface, 2011
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
Proceedings of the 18th International Conference on High Performance Computing, 2011
Proceedings of the 2011 International Conference on Cloud and Service Computing, 2011
Int. J. High Perform. Comput. Appl., 2010
Int. J. High Perform. Comput. Appl., 2010
Int. J. High Perform. Comput. Appl., 2010
Proceedings of the Recent Advances in the Message Passing Interface, 2010
Proceedings of the Recent Advances in the Message Passing Interface, 2010
Proceedings of the Recent Advances in the Message Passing Interface, 2010
A study of hardware assisted IP over InfiniBand and its impact on enterprise data center performance.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2010
Proceedings of the IEEE 18th Annual Symposium on High Performance Interconnects, 2010
Proceedings of the 2010 International Conference on High Performance Computing, 2010
Proceedings of the 2010 International Conference on High Performance Computing, 2010
Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models.
Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications, 2010
Proceedings of the 2010 IEEE/ACM Int'l Conference on Green Computing and Communications, 2010
Proceedings of the 2010 IEEE International Conference on Cluster Computing, 2010
Proceedings of the 7th Conference on Computing Frontiers, 2010
ProOnE: a general-purpose protocol onload engine for multi- and many-core architectures.
Comput. Sci. Res. Dev., 2009
Toward message passing for a million processes: characterizing MPI on a massive scale blue gene/P.
Comput. Sci. Res. Dev., 2009
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009
GePSeA: A General-Purpose Software Acceleration Framework for Lightweight Task Offloading.
Proceedings of the ICPP 2009, 2009
Improving Resource Availability by Relaxing Network Allocation Constraints on Blue Gene/P.
Proceedings of the ICPP 2009, 2009
Proceedings of the 15th IEEE International Conference on Parallel and Distributed Systems, 2009
Proceedings of the 15th IEEE International Conference on Parallel and Distributed Systems, 2009
Tutorial: Designing High-End Computing Systems with Infiniband and 10-Gigabit Ethernet.
Proceedings of the 17th IEEE Symposium on High Performance Interconnects, 2009
Proceedings of the 17th IEEE Symposium on High Performance Interconnects, 2009
Proceedings of the 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, 2009
Asymmetric interactions in symmetric multi-core systems: analysis, enhancements and evaluation.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008
Proceedings of the 17th International Conference on Computer Communications and Networks, 2008
Proceedings of the 17th International Symposium on High-Performance Distributed Computing (HPDC-17 2008), 2008
Proceedings of the High Performance Computing, 2008
Communication Analysis of Parallel 3D FFT for Flat Cartesian Meshes on Large Blue Gene Systems.
Proceedings of the High Performance Computing, 2008
Sockets Direct Protocol for Hybrid Network Stacks: A Case Study with iWARP over 10G Ethernet.
Proceedings of the High Performance Computing, 2008
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008
Analyzing the impact of supporting out-of-order communication on in-order performance with iWARP.
Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007
Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007
Proceedings of the 2007 International Conference on Parallel Processing (ICPP 2007), 2007
Proceedings of the 2007 International Conference on Parallel Processing (ICPP 2007), 2007
Proceedings of the 15th Annual IEEE Symposium on High-Performance Interconnects, 2007
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007
Designing next generation data-centers with advanced communication protocols and systems services.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006
Asynchronous zero-copy communication for synchronous sockets in the sockets direct protocol (SDP) over InfiniBand.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006
Exploiting NIC architectural support for enhancing IP-based protocols on high-performance networks.
J. Parallel Distributed Comput., 2005
On the provision of prioritization and soft qos in dynamically reconfigurable shared data-centers over infiniband.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2005
Proceedings of the 13th Annual IEEE Symposium on High Performance Interconnects (HOTIC 2005), 2005
Proceedings of the 2005 IEEE International Conference on Cluster Computing (CLUSTER 2005), September 26, 2005
Proceedings of the 2005 IEEE International Conference on Cluster Computing (CLUSTER 2005), September 26, 2005
Architecture for caching responses with multiple dynamic dependencies in multi-tier data-centers over InfiniBand.
Proceedings of the 5th International Symposium on Cluster Computing and the Grid (CCGrid 2005), 2005
Proceedings of the 2004 IEEE International Symposium on Performance Analysis of Systems and Software, 2004
Proceedings of the 2004 IEEE International Conference on Cluster Computing (CLUSTER 2004), 2004
Proceedings of the Job Scheduling Strategies for Parallel Processing, 2003
Efficient Collective Operations Using Remote Memory Operations on VIA-Based Clusters.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003
Proceedings of the 12th International Symposium on High-Performance Distributed Computing (HPDC-12 2003), 2003
Proceedings of the 2002 IEEE International Conference on Cluster Computing (CLUSTER 2002), 2002