James Dinan

Orcid: 0000-0002-4840-7737

According to our database1, James Dinan authored at least 64 papers between 2006 and 2021.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2021
Optimizing Work Stealing Communication with Structured Atomic Operations.
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021

2020
Simplifying Communication Overlap in OpenSHMEM Through Integrated User-Level Thread Scheduling.
Proceedings of the High Performance Computing - 35th International Conference, 2020

2019
Designing, Implementing, and Evaluating the Upcoming OpenSHMEM Teams API.
Proceedings of the 2019 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI, 2019

Accelerated Work Stealing.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2018
Lightweight Instrumentation and Analysis Using OpenSHMEM Performance Counters.
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Extreme Heterogeneity, 2018

Design and Optimization of OpenSHMEM 1.4 for the Intel<sup>®</sup> Omni-Path Fabric 100 Series.
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Extreme Heterogeneity, 2018

Efficient Runtime Support for a Partitioned Global Logical Address Space.
Proceedings of the 47th International Conference on Parallel Processing, 2018

2017
Application-Level Optimization of On-Node Communication in OpenSHMEM.
Proceedings of the OpenSHMEM and Related Technologies. Big Compute and Big Data Convergence, 2017

Symmetric Memory Partitions in OpenSHMEM: A Case Study with Intel KNL.
Proceedings of the OpenSHMEM and Related Technologies. Big Compute and Big Data Convergence, 2017

Implementation and Evaluation of OpenSHMEM Contexts Using OFI Libfabric.
Proceedings of the OpenSHMEM and Related Technologies. Big Compute and Big Data Convergence, 2017

Fast Networks and Slow Memories: A Mechanism for Mitigating Bandwidth Mismatches.
Proceedings of the 25th IEEE Annual Symposium on High-Performance Interconnects, 2017

2016
MPI-ACC: Accelerator-Aware MPI for Scientific Applications.
IEEE Trans. Parallel Distributed Syst., 2016

Global-view coefficients: a data management solution for parallel quantum Monte Carlo applications.
Concurr. Comput. Pract. Exp., 2016

An implementation and evaluation of the MPI 3.0 one-sided communication interface.
Concurr. Comput. Pract. Exp., 2016

Work stealing for GPU-accelerated parallel programs in a global address space framework.
Concurr. Comput. Pract. Exp., 2016

Mitigating MPI Message Matching Misery.
Proceedings of the High Performance Computing - 31st International Conference, 2016

Extending a Message Passing Runtime to Support Partitioned, Global Logical Address Spaces.
Proceedings of the First International Workshop on Communication Optimizations in HPC, 2016

Design and Implementation of OpenSHMEM Using OFI on the Aries Interconnect.
Proceedings of the OpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments, 2016

AsHES Introduction and Committees.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

2015
Remote Memory Access Programming in MPI-3.
ACM Trans. Parallel Comput., 2015

AsHES Introduction and Committees.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Versioned Distributed Arrays for Resilience in Scientific Applications: Global View Resilience.
Proceedings of the International Conference on Computational Science, 2015

2014
Processing MPI Derived Datatypes on Noncontiguous GPU-Resident Data.
IEEE Trans. Parallel Distributed Syst., 2014

Enabling communication concurrency through flexible MPI endpoints.
Int. J. High Perform. Comput. Appl., 2014

Enabling Efficient Multithreaded MPI Communication through a Library-Based Implementation of MPI Endpoints.
Proceedings of the International Conference for High Performance Computing, 2014

MC-Checker: Detecting Memory Consistency Errors in MPI One-Sided Applications.
Proceedings of the International Conference for High Performance Computing, 2014

Multi-Threaded OpenSHMEM: A Bad Idea?
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

One-Sided Append: A New Communication Paradigm For PGAS Models.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Contexts: A Mechanism for High Throughput Communication in OpenSHMEM.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Reducing Synchronization Overhead Through Bundled Communication.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

2013
MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory.
Computing, 2013

Dataflow coordination of data-parallel tasks via MPI 3.0.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

Analysis of topology-dependent MPI performance on Gemini networks.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

Enabling MPI interoperability through flexible communication endpoints.
Proceedings of the 20th European MPI Users's Group Meeting, 2013

Synchronization and Ordering Semantics in Hybrid MPI+GPU Programming.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Inspector-Executor Load Balancing Algorithms for Block-Sparse Tensor Contractions.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

Enhancing Performance Portability of MPI Applications through Annotation-Based Transformations.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

pVOCL: Power-Aware Dynamic Placement and Migration in Virtualized GPU Environments.
Proceedings of the IEEE 33rd International Conference on Distributed Computing Systems, 2013

On the efficacy of GPU-integrated MPI for scientific applications.
Proceedings of the 22nd International Symposium on High-Performance Parallel and Distributed Computing, 2013

Toward Asynchronous and MPI-Interoperable Active Messages.
Proceedings of the 13th IEEE/ACM International Symposium on Cluster, 2013

2012
Leveraging MPI's One-Sided Communication Interface for Shared-Memory Programming.
Proceedings of the Recent Advances in the Message Passing Interface, 2012

Efficient Multithreaded Context ID Allocation in MPI.
Proceedings of the Recent Advances in the Message Passing Interface, 2012

On the Usability of the MPI Shared File Pointer Routines.
Proceedings of the Recent Advances in the Message Passing Interface, 2012

PARDA: A Fast Parallel Reuse Distance Analysis Algorithm.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Efficient Intranode Communication in GPU-Accelerated Systems.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Supporting the Global Arrays PGAS Model Using MPI One-Sided Communication.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Load Balancing of Dynamical Nucleation Theory Monte Carlo Simulations through Resource Sharing Barriers.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

DMA-Assisted, Intranode Communication in GPU Accelerated Systems.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

MPI-ACC: An Integrated and Extensible Approach to Data Movement in Accelerator-based Systems.
Proceedings of the 14th IEEE International Conference on High Performance Computing and Communication & 9th IEEE International Conference on Embedded Software and Systems, 2012

A global address space approach to automated data management for parallel Quantum Monte Carlo applications.
Proceedings of the 19th International Conference on High Performance Computing, 2012

Enabling Fast, Noncontiguous GPU Data Movement in Hybrid MPI+GPU Environments.
Proceedings of the 2012 IEEE International Conference on Cluster Computing, 2012

Transparent Accelerator Migration in a Virtualized GPU Environment.
Proceedings of the 12th IEEE/ACM International Symposium on Cluster, 2012

2011
Poster: High-level, one-sided programming models on MPI: a case study with global arrays and NWChem.
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

Noncollective Communicator Creation in MPI.
Proceedings of the Recent Advances in the Message Passing Interface, 2011

2010
Parichute: Generalized Turbocode-Based Error Correction for Near-Threshold Caches.
Proceedings of the 43rd Annual IEEE/ACM International Symposium on Microarchitecture, 2010

Hybrid parallel programming with MPI and unified parallel C.
Proceedings of the 7th Conference on Computing Frontiers, 2010

Selective Recovery from Failures in a Task Parallel Programming Model.
Proceedings of the 10th IEEE/ACM International Conference on Cluster, 2010

2009
Scalable work stealing.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

2008
A message passing benchmark for unbalanced applications.
Simul. Model. Pract. Theory, 2008

Global trees: a framework for linked data structures on distributed memory parallel systems.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Scioto: A Framework for Global-View Task Parallelism.
Proceedings of the 2008 International Conference on Parallel Processing, 2008

2007
Dynamic Load Balancing of Unbalanced Computations Using Message Passing.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

2006
UTS: An Unbalanced Tree Search Benchmark.
Proceedings of the Languages and Compilers for Parallel Computing, 2006

Hardware/Software Integration for FPGA-based All-Pairs Shortest-Paths.
Proceedings of the 14th IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM 2006), 2006


  Loading...