Darius Buntinas

According to our database1, Darius Buntinas authored at least 48 papers between 2000 and 2016.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2016
An implementation and evaluation of the MPI 3.0 one-sided communication interface.
Concurr. Comput. Pract. Exp., 2016

2013
MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory.
Computing, 2013

Exascale workload characterization and architecture implications.
Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2013

Toward Asynchronous and MPI-Interoperable Active Messages.
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013

2012
Poster: An Exascale Workload Study.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: An Exascale Workload Study.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Leveraging MPI's One-Sided Communication Interface for Shared-Memory Programming.
Proceedings of the Recent Advances in the Message Passing Interface, 2012

Efficient Intranode Communication in GPU-Accelerated Systems.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

DMA-Assisted, Intranode Communication in GPU Accelerated Systems.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-based Systems.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

2011
Mpi on millions of Cores.
Parallel Process. Lett., 2011

A uGNI-Based MPICH2 Nemesis Network Module for the Cray XE.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

Run-Through Stabilization: An MPI Proposal for Process Fault Tolerance.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

Scalable Distributed Consensus to Support MPI Fault Tolerance.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

Building algorithmically nonstop fault tolerant MPI programs.
Proceedings of the 18th International Conference on High Performance Computing, 2011

2010
Efficient generated libraries for asynchronous derivative computation.
Proceedings of the International Conference on Computational Science, 2010

Multithreaded derivative computation with generated libraries.
J. Comput. Sci., 2010

Fine-Grained Multithreading Support for Hybrid Threaded MPI Programming.
Int. J. High Perform. Comput. Appl., 2010

Enabling Concurrent Multithreaded MPI Communication on Multicore Petascale Systems.
Proceedings of the Recent Advances in the Message Passing Interface, 2010

PMI: A Scalable Parallel Process-Management Interface for Extreme-Scale Systems.
Proceedings of the Recent Advances in the Message Passing Interface, 2010

Minimizing MPI Resource Contention in Multithreaded Multicore Environments.
Proceedings of the 2010 IEEE International Conference on Cluster Computing, 2010

2009
MPI on a Million Processors.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2009

NewMadeleine: An efficient support for high-performance networks in MPICH2.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Improving Resource Availability by Relaxing Network Allocation Constraints on Blue Gene/P.
Proceedings of the ICPP 2009, 2009

Cache-Efficient, Intranode, Large-Message MPI Communication with MPICH2-Nemesis.
Proceedings of the ICPP 2009, 2009

2008
Blocking vs. non-blocking coordinated checkpointing for large-scale fault tolerant MPI Protocols.
Future Gener. Comput. Syst., 2008

Toward Efficient Support for Multithreaded MPI Communication.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

A Scalable Tools Communications Infrastructure.
Proceedings of the 22nd Annual International Symposium on High Performance Computing Systems and Applications (HPCS 2008), 2008

2007
Implementation and evaluation of shared-memory communication and synchronization operations in MPICH2 using the Nemesis communication subsystem.
Parallel Comput., 2007

Nonuniformly Communicating Noncontiguous Data: A Case Study with PETSc and MPI.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

2006
Implementation and Shared-Memory Evaluation of MPICH2 over the Nemesis Communication Subsystem.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Data Transfers between Processes in an SMP System: Performance Study and Application to MPI.
Proceedings of the 2006 International Conference on Parallel Processing (ICPP 2006), 2006

Design and Evaluation of Nemesis, a Scalable, Low-Latency, Message-Passing Communication Subsystem.
Proceedings of the Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2006), 2006

2005
Designing a Common Communication Subsystem.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

2004
Microbenchmark Performance Comparison of High-Speed Cluster Interconnects.
IEEE Micro, 2004

Application-bypass reduction for large-scale clusters.
Int. J. High Perform. Comput. Netw., 2004

Efficient Implementation of MPI-2 Passive One-Sided Communication on InfiniBand Clusters.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2004

Efficient and Scalable Barrier over Quadrics and Myrinet with a New NIC-Based Collective Message Passing Protocol.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Design and Implementation of MPICH2 over InfiniBand with RDMA Support.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Scalable, high-performance NIC-based all-to-all broadcast over Myrinet/GM.
Proceedings of the 2004 IEEE International Conference on Cluster Computing (CLUSTER 2004), 2004

2003
Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics.
Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003

Optimizing Synchronization Operations for Remote Memory Communication Systems.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

High Performance and Reliable NIC-Based Multicast over Myrinet/GM-2.
Proceedings of the 32nd International Conference on Parallel Processing (ICPP 2003), 2003

Micro-benchmark level performance comparison of high-speed cluster interconnects.
Proceedings of the 11th Annual IEEE Symposium on High Performance Interconnects, 2003

Application-Bypas Broadcast in MPICH over GM.
Proceedings of the 3rd IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2003), 2003

2001
Performance Benefits of NIC-Based Barrier on Myrinet/GM.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

Fast NIC-Based Barrier over Myrinet/GM.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

2000
Broadcast/Multicast over Myrinet Using NIC-Assisted Multidestination Messages.
Proceedings of the Network-Based Parallel Computing: Communication, 2000


  Loading...