Sivasankaran Rajamanickam

SIAM J. Sci. Comput., 2017

Basker: Parallel sparse LU factorization utilizing hierarchical parallelism and data layouts.

[BibT_eX]

[DOI]

Joshua Dennis Booth

Nathan D. Ellingwood

Heidi K. Thornquist

Parallel Comput., 2017

Distributed Graph Layout for Scalable Small-world Network Analysis.

[BibT_eX]

[DOI]

CoRR, 2017

Designing vector-friendly compact BLAS and LAPACK kernels.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2017

Order or Shuffle: Empirically Evaluating Vertex Order Impact on Parallel Graph Computations.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Partitioning Trillion-Edge Graphs in Minutes.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

Performance-Portable Sparse Matrix-Matrix Multiplication for Many-Core Architectures.

[BibT_eX]

[DOI]

Christian Trott

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Fast linear algebra-based triangle counting with KokkosKernels.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017

2016

Multi-Jagged: A Scalable Parallel Spatial Partitioning Algorithm.

[BibT_eX]

[DOI]

Ümit V. Çatalyürek

IEEE Trans. Parallel Distributed Syst., 2016

Complex Network Partitioning Using Label Propagation.

[BibT_eX]

[DOI]

SIAM J. Sci. Comput., 2016

Task Parallel Incomplete Cholesky Factorization using 2D Partitioned-Block Layout.

[BibT_eX]

[DOI]

Kyungjoo Kim

George Stelle

H. Carter Edwards

Stephen L. Olivier

CoRR, 2016

A survey of direct methods for sparse linear systems.

[BibT_eX]

[DOI]

Timothy A. Davis

Wissam M. Sid-Lakhdar

Acta Numer., 2016

A Case Study of Complex Graph Analysis in Distributed Memory: Implementation and Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Parallel Graph Coloring for Manycore Architectures.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Basker: A Threaded Sparse LU Factorization Utilizing Hierarchical Parallelism and Data Layouts.

[BibT_eX]

[DOI]

Joshua Dennis Booth

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

A Comparison of High-Level Programming Choices for Incomplete Sparse Factorization Across Different Architectures.

[BibT_eX]

[DOI]

Joshua Dennis Booth

Kyungjoo Kim

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

2015

High-Performance Graph Analytics on Manycore Processors.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

2014

Towards Extreme-Scale Simulations for Low Mach Fluids with Second-Generation Trilinos.

[BibT_eX]

[DOI]

Paul T. Lin

Matthew T. Bettencourt

Christopher M. Siefert

Stephen Kennon

Parallel Process. Lett., 2014

A Hybrid Approach for Parallel Transistor-Level Full-Chip Circuit Simulation.

[BibT_eX]

[DOI]

Heidi K. Thornquist

Proceedings of the High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Eugene, OR, USA, June 30, 2014

Domain Decomposition Preconditioners for Communication-Avoiding Krylov Methods on a Hybrid CPU/GPU Cluster.

[BibT_eX]

[DOI]

Ichitaro Yamazaki

Proceedings of the International Conference for High Performance Computing, 2014

BFS and Coloring-Based Parallel Algorithms for Strongly Connected Components and Related Problems.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Towards Extreme-Scale Simulations with Next-Generation Trilinos: A Low Mach Fluid Application Case Study.

[BibT_eX]

[DOI]

Paul T. Lin

Matthew T. Bettencourt

Christopher M. Siefert

Eric C. Cyr

Stephen Kennon

Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Exploiting Geometric Partitioning in Task Mapping for Parallel Computers.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014

Building blocks for graph based network analysis.

[BibT_eX]

[DOI]

Vladimir Ufimtsev

Sanjukta Bhowmick

Proceedings of the IEEE High Performance Extreme Computing Conference, 2014

PuLP: Scalable multi-objective multi-constraint partitioning for small-world networks.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), 2014

2013

Electrical modeling and simulation for stockpile stewardship.

[BibT_eX]

[DOI]

Eric R. Keiter

XRDS, 2013

Scalable matrix computations on large scale-free graphs using 2D graph partitioning.

[BibT_eX]

[DOI]

Proceedings of the International Conference for High Performance Computing, 2013

2012

Amesos2 and Belos: Direct and iterative solvers for large sparse linear systems.

[BibT_eX]

[DOI]

Eric Bavier

Mark Hoemmen

Sci. Program., 2012

ShyLU: A Hybrid-Hybrid Solver for Multicore Platforms.

[BibT_eX]

[DOI]

Michael A. Heroux

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Multithreaded Algorithms for Maxmum Matching in Bipartite Graphs.

[BibT_eX]

[DOI]

Ariful Azad

Mahantesh Halappanavar

Arif M. Khan

Alex Pothen

Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

Parallel partitioning with Zoltan: Is hypergraph partitioning worth it?

[BibT_eX]

[DOI]

Proceedings of the Graph Partitioning and Graph Clustering, 2012

2011

Poster: a hybrid-hybrid solver for manycore platforms.

[BibT_eX]

[DOI]

Michael A. Heroux

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

Enabling Next-Generation Parallel Circuit Simulation with Trilinos.

[BibT_eX]

[DOI]

Rich Schiek

Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

2008

Algorithm 887: CHOLMOD, Supernodal Sparse Cholesky Factorization and Update/Downdate.

[BibT_eX]

[DOI]

Yanqing Chen

Timothy A. Davis

William W. Hager