Gengbin Zheng

Dataset, October, 2021

UIUC-PPL/charm: v7.0.0-rc2.

[BibT_eX]

[DOI]

Dataset, September, 2021

UIUC-PPL/charm: v7.0.0-rc1.

[BibT_eX]

[DOI]

Dataset, June, 2021

2020

UIUC-PPL/charm: v6.11.0-beta1.

[BibT_eX]

[DOI]

Dataset, October, 2020

UIUC-PPL/charm: Charm++ version 6.10.2.

[BibT_eX]

[DOI]

Dataset, August, 2020

UIUC-PPL/charm: v6.10.1.

[BibT_eX]

[DOI]

Dataset, March, 2020

UIUC-PPL/charm: v6.10.0.

[BibT_eX]

[DOI]

Dataset, February, 2020

Minimizing the usage of hardware counters for collective communication using triggered operations.

[BibT_eX]

[DOI]

Parallel Comput., 2020

2019

UIUC-PPL/charm: v6.10.0-rc2.

[BibT_eX]

[DOI]

Dataset, October, 2019

UIUC-PPL/charm: v6.10.0-rc.

[BibT_eX]

[DOI]

Dataset, September, 2019

UIUC-PPL/charm: v6.10.0-beta1.

[BibT_eX]

[DOI]

Dataset, August, 2019

2018

Parallelizing MPI Using Tasks for Hybrid Programming Models.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

2017

Why is MPI so slow?: analyzing the fundamental limits in implementing MPI-3.1.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2017

OpenMP<sup>®</sup> Runtime Instrumentation for Optimization.

[BibT_eX]

[DOI]

Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

2015

Using Migratable Objects to Enhance Fault Tolerance Schemes in Supercomputers.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2015

2014

Scaling the ISAM Land Surface Model through Parallelization of Inter-component Data Transfer.

[BibT_eX]

[DOI]

Proceedings of the 43rd International Conference on Parallel Processing, 2014

2013

Communication and topology-aware load balancing in Charm++ with TreeMatch.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

Crack Propagation Analysis with Automatic Load Balancing.

[BibT_eX]

Proceedings of the Parallel Science and Engineering Applications - The Charm++ Approach., 2013

The Charm++ Programming Model.

[BibT_eX]

Proceedings of the Parallel Science and Engineering Applications - The Charm++ Approach., 2013

2012

Optimizing fine-grained communication in a biomolecular simulation application on Cray XK6.

[BibT_eX]

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

A uGNI-based Asynchronous Message-driven Runtime System for Cray Supercomputers with Gemini Interconnect.

[BibT_eX]

[DOI]

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

A scalable double in-memory checkpoint and restart scheme towards exascale.

[BibT_eX]

[DOI]

Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2012

Automated Load Balancing Invocation Based on Application Characteristics.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

2011

Load Balancing, Distributed Memory.

[BibT_eX]

[DOI]

Aaron T. Becker

Proceedings of the Encyclopedia of Parallel Computing, 2011

Parssse: an Adaptive Parallel State Space Search Engine.

[BibT_eX]

[DOI]

Parallel Process. Lett., 2011

Periodic hierarchical load balancing for large supercomputers.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2011

Enabling and scaling biomolecular simulations of 100 million atoms on petascale machines with a multicore-optimized message-driven runtime.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, 2011

An Adaptive Framework for Large-Scale State Space Search.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Automatic Handling of Global Variables for Multi-threaded MPI Programs.

[BibT_eX]

[DOI]

Eduardo Rocha Rodrigues

Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

Simulation-Based Performance Analysis and Tuning for a Two-Level Directly Connected System.

[BibT_eX]

[DOI]

Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

2010

Optimizing a parallel runtime system for multicore clusters: a case study.

[BibT_eX]

[DOI]

Proceedings of the 2010 TeraGrid Conference, 2010

Debugging Large Scale Applications in a Virtualized Environment.

[BibT_eX]

[DOI]

Filippo Gioachin

Proceedings of the Languages and Compilers for Parallel Computing, 2010

Robust non-intrusive record-replay with processor extraction.

[BibT_eX]

[DOI]

Filippo Gioachin

Proceedings of the 8th Workshop on Parallel and Distributed Systems: Testing, 2010

Hierarchical Load Balancing for Charm++ Applications on Large Supercomputers.

[BibT_eX]

[DOI]

Proceedings of the 39th International Conference on Parallel Processing, 2010

Simulating Large Scale Parallel Applications Using Statistical Models for Sequential Execution Blocks.

[BibT_eX]

[DOI]

Proceedings of the 16th IEEE International Conference on Parallel and Distributed Systems, 2010

Automatic MPI to AMPI Program Transformation Using Photran.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

2008

Scalable molecular dynamics with NAMD on the IBM Blue Gene/L system.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2008

Overcoming scaling challenges in biomolecular simulations across multiple platforms.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2006

Performance evaluation of automatic checkpoint-based fault tolerance for AMPI and Charm++.

[BibT_eX]

[DOI]

Chao Huang

ACM SIGOPS Oper. Syst. Rev., 2006

Scaling applications to massively parallel machines using Projections performance analysis tool.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2006

ParFUM: a parallel framework for unstructured meshes for scalable dynamic physics applications.

[BibT_eX]

[DOI]

Eng. Comput., 2006

A system integration framework for coupled multiphysics simulations.

[BibT_eX]

[DOI]

Eng. Comput., 2006

Poster reception - Charm++ simplifies coding for the cell processor.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Performance evaluation of adaptive MPI.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006

Multiple Flows of Control in Migratable Parallel Programs.

[BibT_eX]

[DOI]

Orion Sky Lawlor

Proceedings of the 2006 International Conference on Parallel Processing Workshops (ICPP Workshops 2006), 2006

2005

Achieving High Performance on Extremely Large Parallel Machines: Performance Prediction and Load Balancing

[BibT_eX]

[DOI]

PhD thesis, 2005

Simulation-Based Performance Prediction for Large Parallel Machines.

[BibT_eX]

[DOI]

Terry Wilmarth

Praveen Jagadishprasad

Int. J. Parallel Program., 2005

Performance Prediction Using Simulation of Large-Scale Interconnection Networks in POSE.

[BibT_eX]

[DOI]

Praveen Jagadishprasad

Proceedings of the 19th Workshop on Parallel and Distributed Simulation, 2005

2004

Performance Modeling and Programming Environments for Petaflops Computers and the Blue Gene Machine.

[BibT_eX]

[DOI]

Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

BigSim: A Parallel Simulator for Performance Prediction of Extremely Large Parallel Machines.

[BibT_eX]

[DOI]

Gunavardhan Kakulapati

Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

FTC-Charm++: an in-memory checkpoint-based fault tolerant runtime for Charm++ and MPI.

[BibT_eX]

[DOI]

Lixia Shi

Proceedings of the 2004 IEEE International Conference on Cluster Computing (CLUSTER 2004), 2004

2003

Scaling Molecular Dynamics to 3000 Processors with Projections: A Performance Analysis Case Study.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2003, 2003

2002

NAMD: biomolecular simulation on thousands of processors.

[BibT_eX]

[DOI]

Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

A Parallel-Object Programming Model for PetaFLOPS Machines and Blue Gene/Cyclops.

[BibT_eX]

[DOI]

Arun Kumar Singla

Joshua Mostkoff Unger