Gengbin Zheng

According to our database1, Gengbin Zheng authored at least 54 papers between 2002 and 2021.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2021



2020




Minimizing the usage of hardware counters for collective communication using triggered operations.
Parallel Comput., 2020

2019



2018
Parallelizing MPI Using Tasks for Hybrid Programming Models.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

2017

OpenMP<sup>®</sup> Runtime Instrumentation for Optimization.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

2015
Using Migratable Objects to Enhance Fault Tolerance Schemes in Supercomputers.
IEEE Trans. Parallel Distributed Syst., 2015

2014
Scaling the ISAM Land Surface Model through Parallelization of Inter-component Data Transfer.
Proceedings of the 43rd International Conference on Parallel Processing, 2014

2013
Communication and topology-aware load balancing in Charm++ with TreeMatch.
Proceedings of the 2013 IEEE International Conference on Cluster Computing, 2013

Crack Propagation Analysis with Automatic Load Balancing.
Proceedings of the Parallel Science and Engineering Applications - The Charm++ Approach., 2013

The Charm++ Programming Model.
Proceedings of the Parallel Science and Engineering Applications - The Charm++ Approach., 2013

2012
Optimizing fine-grained communication in a biomolecular simulation application on Cray XK6.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

A uGNI-based Asynchronous Message-driven Runtime System for Cray Supercomputers with Gemini Interconnect.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

A scalable double in-memory checkpoint and restart scheme towards exascale.
Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks Workshops, 2012

Automated Load Balancing Invocation Based on Application Characteristics.
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

2011
Load Balancing, Distributed Memory.
Proceedings of the Encyclopedia of Parallel Computing, 2011

Parssse: an Adaptive Parallel State Space Search Engine.
Parallel Process. Lett., 2011

Periodic hierarchical load balancing for large supercomputers.
Int. J. High Perform. Comput. Appl., 2011

Enabling and scaling biomolecular simulations of 100 million atoms on petascale machines with a multicore-optimized message-driven runtime.
Proceedings of the Conference on High Performance Computing Networking, 2011

An Adaptive Framework for Large-Scale State Space Search.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Automatic Handling of Global Variables for Multi-threaded MPI Programs.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

Simulation-Based Performance Analysis and Tuning for a Two-Level Directly Connected System.
Proceedings of the 17th IEEE International Conference on Parallel and Distributed Systems, 2011

2010
Optimizing a parallel runtime system for multicore clusters: a case study.
Proceedings of the 2010 TeraGrid Conference, 2010

Debugging Large Scale Applications in a Virtualized Environment.
Proceedings of the Languages and Compilers for Parallel Computing, 2010

Robust non-intrusive record-replay with processor extraction.
Proceedings of the 8th Workshop on Parallel and Distributed Systems: Testing, 2010

Hierarchical Load Balancing for Charm++ Applications on Large Supercomputers.
Proceedings of the 39th International Conference on Parallel Processing, 2010

Simulating Large Scale Parallel Applications Using Statistical Models for Sequential Execution Blocks.
Proceedings of the 16th IEEE International Conference on Parallel and Distributed Systems, 2010

Automatic MPI to AMPI Program Transformation Using Photran.
Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

2008
Scalable molecular dynamics with NAMD on the IBM Blue Gene/L system.
IBM J. Res. Dev., 2008

Overcoming scaling challenges in biomolecular simulations across multiple platforms.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

2006
Performance evaluation of automatic checkpoint-based fault tolerance for AMPI and Charm++.
ACM SIGOPS Oper. Syst. Rev., 2006

Scaling applications to massively parallel machines using Projections performance analysis tool.
Future Gener. Comput. Syst., 2006

ParFUM: a parallel framework for unstructured meshes for scalable dynamic physics applications.
Eng. Comput., 2006

A system integration framework for coupled multiphysics simulations.
Eng. Comput., 2006

Poster reception - Charm++ simplifies coding for the cell processor.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Performance evaluation of adaptive MPI.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006

Multiple Flows of Control in Migratable Parallel Programs.
Proceedings of the 2006 International Conference on Parallel Processing Workshops (ICPP Workshops 2006), 2006

2005
Achieving High Performance on Extremely Large Parallel Machines: Performance Prediction and Load Balancing
PhD thesis, 2005

Simulation-Based Performance Prediction for Large Parallel Machines.
Int. J. Parallel Program., 2005

Performance Prediction Using Simulation of Large-Scale Interconnection Networks in POSE.
Proceedings of the 19th Workshop on Parallel and Distributed Simulation, 2005

2004
Performance Modeling and Programming Environments for Petaflops Computers and the Blue Gene Machine.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

BigSim: A Parallel Simulator for Performance Prediction of Extremely Large Parallel Machines.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

FTC-Charm++: an in-memory checkpoint-based fault tolerant runtime for Charm++ and MPI.
Proceedings of the 2004 IEEE International Conference on Cluster Computing (CLUSTER 2004), 2004

2003
Scaling Molecular Dynamics to 3000 Processors with Projections: A Performance Analysis Case Study.
Proceedings of the Computational Science - ICCS 2003, 2003

2002
NAMD: biomolecular simulation on thousands of processors.
Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

A Parallel-Object Programming Model for PetaFLOPS Machines and Blue Gene/Cyclops.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002


  Loading...