2019
Efficient implementation of MPI-3 RMA over openFabrics interfaces.
Parallel Comput., 2019

Software combining to mitigate multithreaded MPI contention.
Proceedings of the ACM International Conference on Supercomputing, 2019

2017
Why is MPI so slow?: analyzing the fundamental limits in implementing MPI-3.1.
Proceedings of the International Conference for High Performance Computing, 2017

Memory Compression Techniques for Network Address Management in MPI.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

2014
Early Evaluation of Scalable Fabric Interface for PGAS Programming Models.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

2012
Case Study: LRZ Liquid Cooling, Energy Management, Contract Specialities.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Network Endpoints for Clusters of SMPs.
Proceedings of the IEEE 24th International Symposium on Computer Architecture and High Performance Computing, 2012

Composable, non-blocking collective operations on power7 IH.
Proceedings of the International Conference on Supercomputing, 2012

2009
Breaking the petaflops barrier.
IBM J. Res. Dev., 2009

2008
BlueGene/L applications: Parallelism On a Massive Scale.
Int. J. High Perform. Comput. Appl., 2008

EUDOC on the IBM Blue Gene/L system: Accelerating the transfer of drug discoveries from laboratory to patient.
IBM J. Res. Dev., 2008

The deep computing messaging framework: generalized scalable message passing on the blue gene/P supercomputer.
Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

2007
The Blue Gene/L Supercomputer: A Hardware and Software Story.
Int. J. Parallel Program., 2007

2006
Blue Gene system software - Design and implementation of a one-sided communication interface for the IBM eServer Blue Gene® supercomputer.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Achieving High Performance on the BlueGene/L Supercomputer.
Proceedings of the Parallel Processing for Scientific Computing, 2006

2005
Blue Gene/L programming and operating environment.
IBM J. Res. Dev., 2005

Design and implementation of message-passing services for the Blue Gene/L supercomputer.
IBM J. Res. Dev., 2005

Optimization of MPI collective communication on BlueGene/L systems.
Proceedings of the 19th Annual International Conference on Supercomputing, 2005

Scaling physics and material science applications on a massively parallel Blue Gene/L system.
Proceedings of the 19th Annual International Conference on Supercomputing, 2005

Early Experience with Scientific Applications on the Blue Gene/L Supercomputer.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

2004
Architecture and Performance of the BlueGene/L Message Layer.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

Implementing MPI on the BlueGene/L Supercomputer.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

2003
MPI on BlueGene/L: Designing an Efficient General Purpose Messaging Solution for a Large Cellular System.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface,10th European PVM/MPI Users' Group Meeting, Venice, Italy, September 29, 2003