Thomas B. Rolinger

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix Multiplication.

[BibT_eX]

[DOI]

Qiang Fu

H. Howie Huang

Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2024

2023

Compiler Optimizations for Irregular Memory Access Patterns in the PGAS Programming Model.

[BibT_eX]

[DOI]

PhD thesis, 2023

Decontentioned Stochastic Block Partition.

[BibT_eX]

[DOI]

Ahsen J. Uppal

H. Howie Huang

Proceedings of the IEEE High Performance Extreme Computing Conference, 2023

2022

Compiler Optimization for Irregular Memory Access Patterns in PGAS Programs.

[BibT_eX]

[DOI]

Proceedings of the Languages and Compilers for Parallel Computing, 2022

2021

Towards High Productivity and Performance for Irregular Applications in Chapel.

[BibT_eX]

[DOI]

Joseph Craft

Proceedings of the 2021 SC Workshops Supplementary Proceedings, 2021

Optimizing Memory-Compute Colocation for Irregular Applications on a Migratory Thread Architecture.

[BibT_eX]

[DOI]

Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Faster Stochastic Block Partition Using Aggressive Initial Merging, Compressed Representation, and Parallelism Control.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

2020

Performance Strategies for Parallel Bitonic Sort on a Migratory Thread Architecture.

[BibT_eX]

[DOI]

Kaushik Velusamy

Janice McMahon

Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020

2019

Performance considerations for scalable parallel tensor decomposition.

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2019

Optimizing Data Layouts for Irregular Applications on a Migratory Thread Architecture.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/ACM Workshop on Memory Centric High Performance Computing, 2019

2018

Impact of Traditional Sparse Optimizations on a Migratory Thread Architecture.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2018

Parallel Sparse Tensor Decomposition in Chapel.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Exploring Parallel Bitonic Sort on a Migratory Thread Architecture.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

An Empirical Evaluation of Allgatherv on Multi-GPU Systems.

[BibT_eX]

[DOI]

Proceedings of the 18th IEEE/ACM International Symposium on Cluster, 2018

2017

Performance challenges for heterogeneous distributed tensor decompositions.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

2016

Performance Evaluation of Parallel Sparse Tensor Decomposition Implementations.

[BibT_eX]

[DOI]