Abhinav Bhatele
Orcid: 0000-0003-3069-3701
According to our database1,
Abhinav Bhatele
authored at least 132 papers
between 2007 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Design Concerns for Integrated Scripting and Interactive Visualization in Notebook Environments.
IEEE Trans. Vis. Comput. Graph., September, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the ISC High Performance 2024 Research Paper Proceedings (39th International Conference), 2024
Proceedings of the 32nd Euromicro International Conference on Parallel, 2024
Proceedings of the 21st IEEE/ACM International Conference on Mining Software Repositories, 2024
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
Proceedings of the 33rd International Symposium on High-Performance Parallel and Distributed Computing, 2024
Proceedings of the 48th IEEE Annual Computers, Software, and Applications Conference, 2024
2023
IEEE Trans. Vis. Comput. Graph., March, 2023
CoRR, 2023
CoRR, 2023
A Novel Tensor-Expert Hybrid Parallelism Approach to Scale Mixture-of-Experts Training.
CoRR, 2023
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
A Hybrid Tensor-Expert-Data Parallelism Approach to Optimize Mixture-of-Experts Training.
Proceedings of the 37th International Conference on Supercomputing, 2023
2022
Designing an Interactive, Notebook-Embedded, Tree Visualization to Support Exploratory Performance Analysis.
CoRR, 2022
Proceedings of the High Performance Computing - 37th International Conference, 2022
AxoNN: An asynchronous, message-driven parallel framework for extreme-scale deep learning.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
2021
IEEE Trans. Vis. Comput. Graph., 2021
Myelin: An asynchronous, message-driven parallel framework for extreme-scale deep learning.
CoRR, 2021
Proceedings of the IEEE International Performance, 2021
2020
CoRR, 2020
Proceedings of the IEEE/ACM International Workshop on HPC User Support Tools and Workshop on Programming and Performance Visualization Tools, 2020
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020
Proceedings of the ICS '20: 2020 International Conference on Supercomputing, 2020
Proceedings of the IEEE International Conference on Cluster Computing, 2020
2019
Proceedings of the International Conference for High Performance Computing, 2019
Optimizing computation-communication overlap in asynchronous task-based programs: poster.
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019
Analyzing Cost-Performance Tradeoffs of HPC Network Designs under Different Constraints using Simulations.
Proceedings of the 2019 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, 2019
Proceedings of the ACM International Conference on Supercomputing, 2019
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019
2018
MemAxes: Visualization and Analytics for Characterizing Complex Memory Performance Behaviors.
IEEE Trans. Vis. Comput. Graph., 2018
Interactive Investigation of Traffic Congestion on Fat-Tree Networks Using TreeScope.
Comput. Graph. Forum, 2018
Proceedings of the International Conference for High Performance Computing, 2018
Proceedings of the International Conference for High Performance Computing, 2018
Proceedings of the Programming and Performance Visualization Tools, 2018
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018
Proceedings of the 32nd International Conference on Supercomputing, 2018
Proceedings of the 47th International Conference on Parallel Processing, 2018
2017
J. Parallel Distributed Comput., 2017
Proceedings of the 2017 Winter Simulation Conference, 2017
Proceedings of the International Conference for High Performance Computing, 2017
Proceedings of the International Conference for High Performance Computing, 2017
Proceedings of the International Conference for High Performance Computing, 2017
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017
Proceedings of the 37th IEEE International Conference on Distributed Computing Systems Workshops, 2017
Quantifying I/O and Communication Traffic Interference on Dragonfly Networks Equipped with Burst Buffers.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017
Massively Parallel Simulations of Spread of Infectious Diseases over Realistic Social Networks.
Proceedings of the 17th IEEE/ACM International Symposium on Cluster, 2017
2016
IEEE Trans. Parallel Distributed Syst., 2016
Proceedings of the 7th International Workshop on Performance Modeling, 2016
Proceedings of the Third Workshop on Visual Performance Analysis, 2016
Characterizing parallel scientific applications on commodity clusters: an empirical study of a tapered fat-tree.
Proceedings of the International Conference for High Performance Computing, 2016
Proceedings of the International Conference for High Performance Computing, 2016
A machine learning framework for performance coverage analysis of proxy applications.
Proceedings of the International Conference for High Performance Computing, 2016
LibPowerMon: A Lightweight Profiling Framework to Profile Program Context and System-Level Metrics.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016
2015
Proceedings of the International Conference for High Performance Computing, 2015
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
Proceedings of the Euro-Par 2015: Parallel Processing Workshops, 2015
2014
Combing the Communication Hairball: Visualizing Parallel Execution Traces using Logical Time.
IEEE Trans. Vis. Comput. Graph., 2014
pF3D Simulations of Laser-Plasma Interactions in National Ignition Facility Experiments.
Comput. Sci. Eng., 2014
Proceedings of the 16th Eurographics Conference on Visualization, 2014
Proceedings of the First Workshop on Visual Performance Analysis, 2014
Proceedings of the International Conference for High Performance Computing, 2014
Proceedings of the International Conference for High Performance Computing, 2014
Proceedings of the International Conference for High Performance Computing, 2014
Extracting logical structure and identifying stragglers in parallel execution traces.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014
Proceedings of the 21st International Conference on High Performance Computing, 2014
2013
Predicting application performance using supervised learning on communication features.
Proceedings of the International Conference for High Performance Computing, 2013
Proceedings of the International Conference for High Performance Computing, 2013
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013
Exploring Traditional and Emerging Parallel Programming Models Using a Proxy Application.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013
Scalable Molecular Dynamics with NAMD.
Proceedings of the Parallel Science and Engineering Applications - The Charm++ Approach., 2013
OpenAtom: Ab initio Molecular Dynamics for Petascale Platforms.
Proceedings of the Parallel Science and Engineering Applications - The Charm++ Approach., 2013
2012
Visualizing Network Traffic to Understand the Performance of Massively Parallel Simulations.
IEEE Trans. Vis. Comput. Graph., 2012
Proceedings of the SC Conference on High Performance Computing Networking, 2012
Proceedings of the SC Conference on High Performance Computing Networking, 2012
Proceedings of the 41st International Conference on Parallel Processing, 2012
2011
Proceedings of the Encyclopedia of Parallel Computing, 2011
Proceedings of the Encyclopedia of Parallel Computing, 2011
Int. J. High Perform. Comput. Appl., 2011
Concurr. Comput. Pract. Exp., 2011
Improving communication performance in dense linear algebra via topology aware collectives.
Proceedings of the Conference on High Performance Computing Networking, 2011
Proceedings of the Conference on High Performance Computing Networking, 2011
Proceedings of the Tools for High Performance Computing 2011, 2011
Architectural Constraints to Attain 1 Exaflop/s for Three Scientific Application Classes.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
Simulation-Based Performance Analysis and Tuning for a Two-Level Directly Connected System.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011
Heuristic-Based Techniques for Mapping Irregular Communication Graphs to Mesh Topologies.
Proceedings of the 13th IEEE International Conference on High Performance Computing & Communication, 2011
Proceedings of the 18th International Conference on High Performance Computing, 2011
2010
Understanding Application Performance via Micro-benchmarks on Three Large Supercomputers: Intrepid, Ranger and Jaguar.
Int. J. High Perform. Comput. Appl., 2010
Proceedings of the 39th International Conference on Parallel Processing, 2010
Proceedings of the 2010 International Conference on High Performance Computing, 2010
2009
Parallel Process. Lett., 2009
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009
An evaluative study on the effect of contention on message latencies in large supercomputers.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009
Dynamic topology aware load balancing algorithms for molecular dynamics applications.
Proceedings of the 23rd international conference on Supercomputing, 2009
Proceedings of the ICPPW 2009, 2009
Proceedings of the Euro-Par 2009 Parallel Processing, 2009
2008
Parallel Process. Lett., 2008
IBM J. Res. Dev., 2008
Fine-grained parallelization of the Car - Parrinello ab initio molecular dynamics method on the IBM Blue Gene/L supercomputer.
IBM J. Res. Dev., 2008
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008
2007
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007