Rajeev Thakur

Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

A Decoupled Execution Paradigm for Data-Intensive High-End Computing.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

Boosting Application-Specific Parallel I/O Optimization Using IOSIG.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012

Transparent Accelerator Migration in a Virtualized GPU Environment.

[BibT_eX]

[DOI]

Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012

2011

Mpi on millions of Cores.

[BibT_eX]

[DOI]

Parallel Process. Lett., 2011

The International Exascale Software Project roadmap.

[BibT_eX]

[DOI]

Bertrand Braunschweig

Int. J. High Perform. Comput. Appl., 2011

The scalable process topology interface of MPI 2.2.

[BibT_eX]

[DOI]

Torsten Hoefler

Rolf Rabenseifner

Hubert Ritzdorf

Bronis R. de Supinski

Jesper Larsson Träff

Concurr. Comput. Pract. Exp., 2011

Formal analysis of MPI-based parallel programs.

[BibT_eX]

[DOI]

Bronis R. de Supinski

Martin Schulz

Greg Bronevetsky

Commun. ACM, 2011

Server-side I/O coordination for parallel file systems.

[BibT_eX]

[DOI]

Proceedings of the Conference on High Performance Computing Networking, 2011

Performance Expectations and Guidelines for MPI Derived Datatypes.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in the Message Passing Interface, 2011

Scalable Memory Use in MPI: A Case Study with MPICH2.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in the Message Passing Interface, 2011

LACIO: A New Collective I/O Strategy for Parallel I/O Systems.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

A Segment-Level Adaptive Data Layout Scheme for Improved Load Balance in Parallel File Systems.

[BibT_eX]

[DOI]

Proceedings of the 11th IEEE/ACM International Symposium on Cluster, 2011

2010

Self-Consistent MPI Performance Guidelines.

[BibT_eX]

[DOI]

Jesper Larsson Träff

IEEE Trans. Parallel Distributed Syst., 2010

Formal methods applied to high-performance computing software design: a case study of MPI one-sided communication-based locking.

[BibT_eX]

[DOI]

Salman Pervez

Softw. Pract. Exp., 2010

A study of dynamic meta-learning for failure prediction in large-scale systems.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2010

A Pipelined Algorithm for Large, Irregular All-Gather Problems.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2010

The Importance of Non-Data-Communication Overheads in MPI.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2010

Fine-Grained Multithreading Support for Hybrid Threaded MPI Programming.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2010

Global-scale distributed I/O with ParaMEDIC.

[BibT_eX]

[DOI]

Concurr. Comput. Pract. Exp., 2010

Implementing MPI on Windows: Comparison with Common Approaches on Unix.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in the Message Passing Interface, 2010

Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in the Message Passing Interface, 2010

Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in the Message Passing Interface, 2010

Dynamic Verification of Hybrid Programs.

[BibT_eX]

[DOI]

Wei-Fan Chiang

Grzegorz Szubzda

Proceedings of the Recent Advances in the Message Passing Interface, 2010

PMI: A Scalable Parallel Process-Management Interface for Extreme-Scale Systems.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in the Message Passing Interface, 2010

Enabling active storage on parallel I/O software stacks.

[BibT_eX]

[DOI]

Proceedings of the IEEE 26th Symposium on Mass Storage Systems and Technologies, 2010

A layout-aware optimization strategy for collective I/O.

[BibT_eX]

[DOI]

Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, 2010

Minimizing MPI Resource Contention in Multithreaded Multicore Environments.

[BibT_eX]

[DOI]

Bronis R. de Supinski

Proceedings of the 2010 IEEE International Conference on Cluster Computing, 2010

Improving Parallel I/O Performance with Data Layout Awareness.

[BibT_eX]

[DOI]

Proceedings of the 2010 IEEE International Conference on Cluster Computing, 2010

Hybrid parallel programming with MPI and unified parallel C.

[BibT_eX]

[DOI]

Proceedings of the 7th Conference on Computing Frontiers, 2010

2009

Test suite for evaluating performance of multithreaded MPI communication.

[BibT_eX]

[DOI]

Parallel Comput., 2009

ProOnE: a general-purpose protocol onload engine for multi- and many-core architectures.

[BibT_eX]

[DOI]

Comput. Sci. Res. Dev., 2009

Toward message passing for a million processes: characterizing MPI on a massive scale blue gene/P.

[BibT_eX]

[DOI]

Comput. Sci. Res. Dev., 2009

A configurable algorithm for parallel image-compositing applications.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

Hierarchical Collectives in MPICH2.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

Sound and Efficient Dynamic Verification of MPI Programs with Probe Non-determinism.

[BibT_eX]

[DOI]

Jason Williams

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

Static-Analysis Assisted Dynamic Verification of MPI Waitany Programs (Poster Abstract).

[BibT_eX]

[DOI]

Grzegorz Szubzda

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

Conflict Detection Algorithm to Minimize Locking for MPI-IO Atomicity.

[BibT_eX]

[DOI]

Saba Sehrish

Jun Wang

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

Processing MPI Datatypes Outside MPI.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

MPI on a Million Processors.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

How Formal Dynamic Verification Tools Facilitate Novel Concurrency Visualizations.

[BibT_eX]

[DOI]

Sriram Aananthakrishnan

Michael Delisi

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

Formal verification of practical MPI programs.

[BibT_eX]

[DOI]

Michael Delisi

Gopalakrishnan Santhanaraman

Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

Investigating High Performance RMA Interfaces for the MPI-3 Standard.

[BibT_eX]

[DOI]

Proceedings of the ICPP 2009, 2009

Natively Supporting True One-Sided Communication in.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, 2009

2008

Hiding I/O latency with pre-execution prefetching for parallel applications.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Parallel I/O prefetching using MPI file caching and I/O signatures.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Implementing Efficient Dynamic Formal Verification Methods for MPI Programs.

[BibT_eX]

[DOI]

Michael Delisi

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

A Simple, Pipelined Algorithm for Large, Irregular All-gather Problems.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

A Formal Approach to Detect Functionally Irrelevant Barriers in MPI Programs.

[BibT_eX]

[DOI]

Subodh Sharma

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

Self-consistent MPI-IO Performance Requirements and Expectations.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

Non-data-communication Overheads in MPI: Analysis on Blue Gene/P.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

Toward Efficient Support for Multithreaded MPI Communication.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

Semantics-based distributed I/O for mpiBLAST.

[BibT_eX]

[DOI]

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008

2008 International Conference on Parallel Processing September 8-12, 2008 Portland, Oregon Exploring Parallel I/O Concurrency with Speculative Prefetching.

[BibT_eX]

[DOI]

Proceedings of the 2008 International Conference on Parallel Processing, 2008

Communication Analysis of Parallel 3D FFT for Flat Cartesian Meshes on Large Blue Gene Systems.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 2008

Sockets Direct Protocol for Hybrid Network Stacks: A Case Study with iWARP over 10G Ethernet.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing, 2008

2007

Thread-safety in an MPI implementation: Requirements and analysis.

[BibT_eX]

[DOI]

Parallel Comput., 2007

Implementing MPI-IO Atomic Mode and Shared File Pointers Using MPI One-Sided Communication.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 2007

Analyzing the impact of supporting out-of-order communication on in-order performance with iWARP.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007

Self-consistent MPI Performance Requirements.

[BibT_eX]

[DOI]

Jesper Larsson Träff

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

Test Suite for Evaluating Performance of MPI Implementations That Support MPI_THREAD_MULTIPLE.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

Practical Model-Checking Method for Verifying Correctness of MPI Programs.

[BibT_eX]

[DOI]

Salman Pervez

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

Extending the MPI-2 Generalized Request Interface.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

Revealing the Performance of MPI RMA Implementations.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

Parallel I/O Performance Characterization of Columbia and NEC SX-8 Superclusters.

[BibT_eX]

[DOI]

Subhash Saini

Dale Talcott

Panagiotis A. Adamidis

Rolf Rabenseifner

Robert Ciotti

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Nonuniformly Communicating Noncontiguous Data: A Case Study with PETSc and MPI.

[BibT_eX]

[DOI]

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

A Meta-Learning Failure Predictor for Blue Gene/L Systems.

[BibT_eX]

[DOI]

Proceedings of the 2007 International Conference on Parallel Processing (ICPP 2007), 2007

Advanced Flow-control Mechanisms for the Sockets Direct Protocol over InfiniBand.

[BibT_eX]

[DOI]

Proceedings of the 2007 International Conference on Parallel Processing (ICPP 2007), 2007

Open Issues in MPI Implementation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Computer Systems Architecture, 2007

2006

Discretionary Caching for I/O on Clusters.

[BibT_eX]

[DOI]

Murali Vilayannur

Anand Sivasubramaniam

Mahmut T. Kandemir

Clust. Comput., 2006

M02 - Parallel I/O in practice.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

S01 - Advanced MPI: I/O and one-sided communication.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Formal Verification of Programs That Use MPI One-Sided Communication.

[BibT_eX]

[DOI]

Salman Pervez

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Can MPI Be Used for Persistent Parallel Services?

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Issues in Developing a Thread-Safe MPI Implementation.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Automatic Memory Optimizations for Improving MPI Derived Datatype Performance.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Collective communication on architectures that support simultaneous communication over multiple links.

[BibT_eX]

[DOI]

Ernie Chan

Robert A. van de Geijn

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006

MPI-IO/L: efficient remote I/O for MPI-IO via logistical networking.

[BibT_eX]

[DOI]

Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

High performance file I/O for the Blue Gene/L supercomputer.

[BibT_eX]

[DOI]

Proceedings of the 12th International Symposium on High-Performance Computer Architecture, 2006

A New Flexible MPI Collective I/O Implementation.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006

2005

Optimization of Collective Communication Operations in MPICH.

[BibT_eX]

[DOI]

Rolf Rabenseifner

Int. J. High Perform. Comput. Appl., 2005

Optimizing the Synchronization Operations in Message Passing Interface One-Sided Communication.

[BibT_eX]

[DOI]

Brian R. Toonen

Int. J. High Perform. Comput. Appl., 2005

Implementing Byte-Range Locks Using MPI One-Sided Communication.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

Implementing MPI-IO Shared File Pointers Without File System Support.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

An Evaluation of Implementation Options for MPI One-Sided Communication.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

Implementing MPI-IO atomic mode without file system support.

[BibT_eX]

[DOI]

Proceedings of the 5th International Symposium on Cluster Computing and the Grid (CCGrid 2005), 2005

2004

Minimizing Synchronization Overhead in the Implementation of MPI One-Sided Communication.

[BibT_eX]

[DOI]

Brian R. Toonen

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

The Impact of File Systems on MPI-IO Scalability.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

Efficient Implementation of MPI-2 Passive One-Sided Communication on InfiniBand Clusters.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

On the Performance of the POSIX I/O Interface to PVFS.

[BibT_eX]

[DOI]

Anand Sivasubramaniam

Mahmut T. Kandemir

Proceedings of the 12th Euromicro Workshop on Parallel, 2004

RFS: efficient and flexible remote file access for MPI-IO.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Cluster Computing (CLUSTER 2004), 2004

Predicting memory-access cost based on data-access patterns.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Cluster Computing (CLUSTER 2004), 2004

High performance MPI-2 one-sided communication over InfiniBand.

[BibT_eX]

[DOI]

Proceedings of the 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2004), 2004

2003

High-performance scientific data management system.

[BibT_eX]

[DOI]

Jaechun No

J. Parallel Distributed Comput., 2003

Parallel netCDF: A Scientific High-Performance I/O Interface

[BibT_eX]

[DOI]

CoRR, 2003

Parallel netCDF: A High-Performance Scientific I/O Interface.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003

Improving the Performance of Collective Operations in MPICH.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface,10th European PVM/MPI Users' Group Meeting, Venice, Italy, September 29, 2003

Using MPI-2: Advanced Features of the Message Passing Interface.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003

Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access Cost.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003

2002

Optimizing noncontiguous accesses in MPI-IO.

[BibT_eX]

[DOI]

Parallel Comput., 2002

2001

Evaluation of Collective I/O Implementations on Parallel Architectures.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2001

High-performance file I/O in Java: Existing approaches and bulk I/O extensions.

[BibT_eX]

[DOI]

Dan Bonachea

Concurr. Comput. Pract. Exp., 2001

A Scientific Data Management System for Irregular Applications.

[BibT_eX]

[DOI]

Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

2000

Data management for large-scale scientific computations in high performance distributed systems.

[BibT_eX]

[DOI]

Clust. Comput., 2000

Integrating Parallel File I/O and Database Support for High-Performance Scientific Data Management.

[BibT_eX]

[DOI]

Jaechun No

Proceedings of the Proceedings Supercomputing 2000, 2000

An evaluation of Java's I/O capabilities for high-performance computing.

[BibT_eX]

[DOI]

Proceedings of the ACM 2000 Java Grande Conference, San Francisco, CA, USA, 2000

Parallel I/O and Storage Technology.

[BibT_eX]

[DOI]

Rolf Hempel

Elizabeth A. M. Shriver

Peter Brezany

Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

PVFS: A Parallel File System for Linux Clusters.

[BibT_eX]

[DOI]

Proceedings of the 4th Annual Linux Showcase & Conference 2000, 2000

1999

Improving Collective I/O Performance Using Threads.

[BibT_eX]

[DOI]

Proceedings of the 13th International Parallel Processing Symposium / 10th Symposium on Parallel and Distributed Processing (IPPS / SPDP '99), 1999

On Implementing MPI-IO Portably and with High Performance.

[BibT_eX]

[DOI]

Proceedings of the Sixth Workshop on I/O in Parallel and Distributed Systems, 1999

Data Management for Large-Scale Scientific Computations in High Performance Distributed Systems.

[BibT_eX]

[DOI]

Proceedings of the Eighth IEEE International Symposium on High Performance Distributed Computing, 1999

1998

I/O in Parallel Applications: the Weakest Link.

[BibT_eX]

[DOI]

Int. J. High Perform. Comput. Appl., 1998

A Case for Using MPI's Derived Datatypes to Improve I/O Performance.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on Supercomputing, 1998

1996

Efficient Algorithms for Array Redistribution.

[BibT_eX]

[DOI]

J. Ramanujam

IEEE Trans. Parallel Distributed Syst., 1996

An Extended Two-Phase Method for Accessing Sections of Out-of-Core Arrays.

[BibT_eX]

[DOI]

Sivaramakrishna Kuditipudi

Sci. Program., 1996

Passion: Optimized I/O for Parallel Applications.

[BibT_eX]

[DOI]

Computer, 1996

An Experimental Evaluation of the Parallel I/O Systems of the IBM SP and Intel Paragon Using a Production Application.

[BibT_eX]

[DOI]

Proceedings of the Parallel Computation, 1996

Runtime Support for Out-of-Core Parallel Programs.

[BibT_eX]

[DOI]

Proceedings of the Input/Output in Parallel and Distributed Computer Systems., 1996

1995

Complete exchange on the CM-5 and Touchstone Delta.

[BibT_eX]

[DOI]

J. Supercomput., 1995

1994

Compilation of out-of-core data parallel programs for distributed memory machines.

[BibT_eX]

[DOI]

Rajesh Bordawekar

SIGARCH Comput. Archit. News, 1994

Connected Component Labeling on Coarse Grain Parallel Computers: An Experimental Study.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 1994

Complete Exchange on a Wormhole Routed Mesh.

[BibT_eX]

[DOI]

Geoffrey C. Fox

Proceedings of the MASCOTS '94, Proceedings of the Second International Workshop on Modeling, Analysis, and Simulation On Computer and Telecommunication Systems, January 31, 1994

All-to-All Communication on Meshes with Wormhole Routing.

[BibT_eX]

[DOI]

Proceedings of the 8th International Symposium on Parallel Processing, 1994

Compiler and runtime support for out-of-core HPF programs.

[BibT_eX]

[DOI]

Rajesh Bordawekar