Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

2020

Computing Bottleneck Structures at Scale for High-Precision Network Performance Analysis.

[DOI]

Proceedings of the IEEE/ACM Innovating the Network for Data-Intensive Science, 2020

Multiscale Data Analysis Using Binning, Tensor Decompositions, and Backtracking.

[DOI]

Dimitri Leggas

Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020

Approximate Inverse Chain Preconditioner: Iteration Count Case Study for Spectral Support Solvers.

[DOI]

Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020

2019

On the Bottleneck Structure of Congestion-Controlled Networks.

[DOI]

Jordi Ros-Giralt

Atul Bohara

Sruthi Yellamraju

Malathi Veeraraghavan

Proc. ACM Meas. Anal. Comput. Syst., 2019

Enhancing Network Visibility and Security through Tensor Analysis.

[DOI]

Future Gener. Comput. Syst., 2019

PUMA-V: Optimizing Parallel Code Performance Through Interactive Visualization.

[DOI]

Eric Papenhausen

Klaus Mueller

IEEE Computer Graphics and Applications, 2019

G2: A Network Optimization Framework for High-Precision Analysis of Bottleneck and Flow Performance.

[DOI]

Malathi Veeraraghavan

Yuang Jiang

Proceedings of the 2019 IEEE/ACM Innovating the Network for Data-Intensive Science, 2019

Combinatorial Multigrid: Advanced Preconditioners For Ill-Conditioned Linear Systems.

[DOI]

Mitchell Tong Harris

Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

Fast Large-Scale Algorithm for Electromagnetic Wave Propagation in 3D Media.

[DOI]

Mitchell Tong Harris

Pierre-David Létourneau

George Papanicolaou

Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

Combining Tensor Decompositions and Graph Analytics to Provide Cyber Situational Awareness at HPC Scale.

[DOI]

Proceedings of the 2019 IEEE High Performance Extreme Computing Conference, 2019

2018

Algorithms and data structures to accelerate network analysis.

[DOI]

Future Gener. Comput. Syst., 2018

Fast Detection of Elephant Flows with Dirichlet-Categorical Inference.

[DOI]

Proceedings of the 5th IEEE/ACM International Workshop on Innovating the Network for Data-Intensive Science, 2018

Accelerating Dijkstra's Algorithm Using Multiresolution Priority Queues.

[DOI]

Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

Computationally Efficient CP Tensor Decomposition Update Framework for Emerging Component Discovery in Streaming Data.

[DOI]

Tom Henretty

Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

All-at-once Decomposition of Coupled Billion-scale Tensors in Apache Spark.

[DOI]

Aditya Gudibanda

Tom Henretty

Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

2017

Multiresolution Priority Queues.

[DOI]

CoRR, 2017

Report of the HPC Correctness Summit, Jan 25-26, 2017, Washington, DC.

[DOI]

Ganesh Gopalakrishnan

Paul D. Hovland

Costin Iancu

Sriram Krishnamoorthy

CoRR, 2017

Polyhedral Optimization of TensorFlow Computation Graphs.

[DOI]

Jonathan Springer

Proceedings of the Programming and Performance Visualization Tools, 2017

Memory-efficient parallel tensor decompositions.

[DOI]

Tom Henretty

David Bruns-Smith

Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

2016

Efficient Compilation to Event-Driven Task Programs.

[DOI]

CoRR, 2016

A sparse multidimensional FFT for real positive vectors.

[DOI]

CoRR, 2016

Highly Scalable Near Memory Processing with Migrating Threads on the Emu System Architecture.

[DOI]

Proceedings of the 6th Workshop on Irregular Applications: Architecture and Algorithms, 2016

Automatic Code Generation and Data Management for an Asynchronous Task-Based Runtime.

[DOI]

Proceedings of the 5th Workshop on Extreme-Scale Programming Tools, 2016

An Interactive Visual Tool for Code Optimization and Parallelization Based on the Polyhedral Model.

[DOI]

Proceedings of the 45th International Conference on Parallel Processing Workshops, 2016

Scalable Hierarchical Polyhedral Compilation.

[DOI]

Proceedings of the 45th International Conference on Parallel Processing, 2016

High-performance algorithms and data structures to catch elephant flows.

[DOI]

Malathi Veeraraghavan

Proceedings of the 2016 IEEE High Performance Extreme Computing Conference, 2016

Polyhedral compilation for energy efficiency.

[DOI]

Proceedings of the 2016 IEEE High Performance Extreme Computing Conference, 2016

A sparse multi-dimensional Fast Fourier Transform with stability to noise in the context of image processing and change detection.

[DOI]

Proceedings of the 2016 IEEE High Performance Extreme Computing Conference, 2016

Accelerated low-rank updates to tensor decompositions.

[DOI]

Proceedings of the 2016 IEEE High Performance Extreme Computing Conference, 2016

A unified Coq framework for verifying C programs with floating-point computations.

[DOI]

Proceedings of the 5th ACM SIGPLAN Conference on Certified Programs and Proofs, 2016

2015

Polyhedral user mapping and assistant visualizer tool for the r-stream auto-parallelizing compiler.

[DOI]

Eric Papenhausen

Bing Wang

Proceedings of the 3rd IEEE Working Conference on Software Visualization, 2015

High-performance many-core networking: design and implementation.

[DOI]

Proceedings of the Second Workshop on Innovating the Network for Data-Intensive Science, 2015

Automatic cluster parallelization and minimizing communication via selective data replication.

[DOI]

Sanket Tavarageri

Ann Johnson

Proceedings of the 2015 IEEE High Performance Extreme Computing Conference, 2015

Embedded second-order cone programming with radar applications.

[DOI]

Proceedings of the 2015 IEEE High Performance Extreme Computing Conference, 2015

Optimization of symmetric tensor computations.

[DOI]

Jonathon Cai

Proceedings of the 2015 IEEE High Performance Extreme Computing Conference, 2015

2014

A Tale of Three Runtimes.

[DOI]

CoRR, 2014

Parallelizing and optimizing sparse tensor computations.

[DOI]

Proceedings of the 2014 International Conference on Supercomputing, 2014

Lockless hash tables with low false negatives.

[DOI]

Proceedings of the IEEE High Performance Extreme Computing Conference, 2014

Low-overhead load-balanced scheduling for sparse tensor computations.

[DOI]

Proceedings of the IEEE High Performance Extreme Computing Conference, 2014

2013

Re-Introduction of communication-avoiding FMM-accelerated FFTs with GPU acceleration.

[DOI]

Proceedings of the IEEE High Performance Extreme Computing Conference, 2013

Runnemede: An architecture for Ubiquitous High-Performance Computing.

[DOI]

Proceedings of the 19th IEEE International Symposium on High Performance Computer Architecture, 2013

Memory reuse optimizations in the R-Stream compiler.

[DOI]

Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units, 2013

2012

Scalable Cyber-Security for Terabit Cloud Computing.

[DOI]

Jordi Ros-Giralt

Péter Szilágyi

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Automatic communication optimizations through memory reuse strategies.

[DOI]

Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

Efficient and scalable computations with sparse tensors.

[DOI]

Proceedings of the IEEE Conference on High Performance Extreme Computing, 2012

2011

R-Stream Compiler.

[DOI]

David Wohlford

Allen Leung

Proceedings of the Encyclopedia of Parallel Computing, 2011

2010

A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction.

[DOI]

Allen Leung

David Wohlford

Cédric Bastoul

Proceedings of 3rd Workshop on General Purpose Processing on Graphics Processing Units, 2010

2007

Evaluation of Stream Virtual Machine on Raw Processor.

[DOI]

Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

2006

Poster reception - Alef parallel SAT solver for HPC hardware.

[DOI]

Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

2003

Introduction.

[DOI]

Commun. ACM, 2003

1998

Retrospective: the J-machine.

[DOI]

Proceedings of the 25 Years of the International Symposia on Computer Architecture (Selected Papers)., 1998

1997

Message-driven dynamics.

[DOI]

PhD thesis, 1997

1992

The message-driven processor: a multicomputer processing node with efficient mechanisms.

[DOI]

IEEE Micro, 1992

MDP Design Tools and Methods.

[DOI]