2024

Automatic Tracing in Task-Based Runtime Systems.

[DOI]

Rohan Yadav

Michael Bauer

CoRR, 2024

Composing Distributed Computations Through Task and Kernel Fusion.

[DOI]

CoRR, 2024

2023

Legate Sparse: Distributed Sparse Computing in Python.

[DOI]

Proceedings of the International Conference for High Performance Computing, 2023

Visibility Algorithms for Dynamic Dependence Analysis and Distributed Coherence.

[DOI]

Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2023

2021

Supercomputing in Python With Legate.

[DOI]

Comput. Sci. Eng., 2021

Index launches: scalable, flexible representation of parallel task groups.

[DOI]

Proceedings of the International Conference for High Performance Computing, 2021

Scaling implicit parallelism via dynamic control replication.

[DOI]

Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

2019

Legate NumPy: accelerated and distributed array computing.

[DOI]

Michael Bauer

Michael Garland

Proceedings of the International Conference for High Performance Computing, 2019

2018

Dynamic tracing: memoization of task graphs for dynamic task-based runtimes.

[DOI]

Proceedings of the International Conference for High Performance Computing, 2018

2017

Control replication: compiling implicit parallelism to efficient SPMD with logical regions.

[DOI]

Proceedings of the International Conference for High Performance Computing, 2017

Integrating External Resources with a Task-Based Programming Model.

[DOI]

Proceedings of the 24th IEEE International Conference on High Performance Computing, 2017

2016

Dependent partitioning.

[DOI]

Proceedings of the 2016 ACM SIGPLAN International Conference on Object-Oriented Programming, 2016

2015

Regent: a high-productivity programming language for HPC with logical regions.

[DOI]

Proceedings of the International Conference for High Performance Computing, 2015

Verification of producer-consumer synchronization in GPU programs.

[DOI]

Rahul Sharma

Michael Bauer

Alex Aiken

Proceedings of the 36th ACM SIGPLAN Conference on Programming Language Design and Implementation, 2015

2014

Legion: programming distributed heterogeneous architectures with logical regions.

[DOI]

Michael Bauer

PhD thesis, 2014

Structure Slicing: Extending Logical Regions with Fields.

[DOI]

Proceedings of the International Conference for High Performance Computing, 2014

Singe: leveraging warp specialization for high performance on GPUs.

[DOI]

Michael Bauer

Sean Treichler

Alex Aiken

Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2014

Realm: an event-based low-level runtime for distributed memory architectures.

[DOI]

Sean Treichler

Michael Bauer

Alex Aiken

Proceedings of the International Conference on Parallel Architectures and Compilation, 2014

2013

Language support for dynamic, hierarchical data partitioning.

[DOI]

Sean Treichler

Michael Bauer

Alex Aiken

Proceedings of the 2013 ACM SIGPLAN International Conference on Object Oriented Programming Systems Languages & Applications, 2013

2012

Legion: expressing locality and independence with logical regions.

[DOI]

Proceedings of the SC Conference on High Performance Computing Networking, 2012

2011

CudaDMA: optimizing GPU memory bandwidth via warp specialization.

[DOI]

Michael Bauer

Henry Cook

Brucek Khailany

Proceedings of the Conference on High Performance Computing Networking, 2011

Programming the memory hierarchy revisited: supporting irregular parallelism in sequoia.

[DOI]

Proceedings of the 16th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2011