Grey Ballard
Orcid: 0000-0003-1557-8027
According to our database1,
Grey Ballard
authored at least 77 papers
between 2006 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Communication Lower Bounds and Optimal Algorithms for Multiple Tensor-Times-Matrix Computation.
SIAM J. Matrix Anal. Appl., March, 2024
CoRR, 2024
Proceedings of the 2024 SIAM Conference on Parallel Processing for Scientific Computing, 2024
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
2023
Numer. Linear Algebra Appl., December, 2023
SIAM J. Sci. Comput., February, 2023
Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures, 2023
Proceedings of the 37th International Conference on Supercomputing, 2023
2022
CoRR, 2022
Brief Announcement: Tight Memory-Independent Parallel Matrix Multiplication Communication Lower Bounds.
Proceedings of the SPAA '22: 34th ACM Symposium on Parallelism in Algorithms and Architectures, Philadelphia, PA, USA, July 11, 2022
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
2021
ACM Trans. Math. Softw., 2021
Accelerating Neural Network Training using Arbitrary Precision Approximating Matrix Multiplication Algorithms.
Proceedings of the ICPP Workshops 2021: 50th International Conference on Parallel Processing, 2021
Proceedings of the ICPP 2021: 50th International Conference on Parallel Processing, Lemont, IL, USA, August 9, 2021
Proceedings of the 9th IEEE/ACM Workshop on Education for High Performance Computing, 2021
2020
TuckerMPI: A Parallel C++/MPI Software Package for Large-scale Data Compression via the Tucker Tensor Decomposition.
ACM Trans. Math. Softw., 2020
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the 2020 SIAM Conference on Parallel Processing for Scientific Computing, 2020
Proceedings of the 27th IEEE International Conference on High Performance Computing, 2020
2019
Joint 3D Localization and Classification of Space Debris using a Multispectral Rotating Point Spread Function.
CoRR, 2019
Dynamic Functional Magnetic Resonance Imaging Connectivity Tensor Decomposition: A New Approach to Analyze and Interpret Dynamic Brain Connectivity.
Brain Connect., 2019
2018
MPI-FAUN: An MPI-Based Framework for Alternating-Updating Nonnegative Matrix Factorization.
IEEE Trans. Knowl. Data Eng., 2018
CoRR, 2018
Proceedings of the 30th on Symposium on Parallelism in Algorithms and Architectures, 2018
Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2018
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018
Partitioning and Communication Strategies for Sparse Non-negative Matrix Factorization.
Proceedings of the 47th International Conference on Parallel Processing, 2018
Proceedings of the 25th IEEE International Conference on High Performance Computing, 2018
2017
ACM Trans. Parallel Comput., 2017
Proceedings of the 29th ACM Symposium on Parallelism in Algorithms and Architectures, 2017
2016
ACM Trans. Parallel Comput., 2016
Reducing Communication Costs for Sparse Matrix Multiplication within Algebraic Multigrid.
SIAM J. Sci. Comput., 2016
SIAM J. Sci. Comput., 2016
SIAM J. Matrix Anal. Appl., 2016
Proceedings of the First International Workshop on Communication Optimizations in HPC, 2016
Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016
2015
ACM Trans. Parallel Comput., 2015
J. Parallel Distributed Comput., 2015
CoRR, 2015
Brief Announcement: Hypergraph Partitioning for Parallel Sparse Matrix-Matrix Multiplication.
Proceedings of the 27th ACM on Symposium on Parallelism in Algorithms and Architectures, 2015
Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2015
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015
2014
SIAM J. Matrix Anal. Appl., 2014
Acta Numer., 2014
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014
2013
Communication efficient gaussian elimination with partial pivoting using a shape morphing data layout.
Proceedings of the 25th ACM Symposium on Parallelism in Algorithms and Architectures, 2013
Proceedings of the 25th ACM Symposium on Parallelism in Algorithms and Architectures, 2013
Implementing a Blocked Aasen's Algorithm with a Dynamic Scheduler on Multicore Architectures.
Proceedings of the 27th IEEE International Symposium on Parallel and Distributed Processing, 2013
2012
Strong Scaling of Matrix Multiplication Algorithms and Memory-Independent Communication Lower Bounds
CoRR, 2012
Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, 2012
Brief announcement: strong scaling of matrix multiplication algorithms and memory-independent communication lower bounds.
Proceedings of the 24th ACM Symposium on Parallelism in Algorithms and Architectures, 2012
Proceedings of the SC Conference on High Performance Computing Networking, 2012
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012
Graph Expansion Analysis for Communication Costs of Fast Rectangular Matrix Multiplication.
Proceedings of the Design and Analysis of Algorithms, 2012
2011
SIAM J. Matrix Anal. Appl., 2011
Graph expansion and communication costs of fast matrix multiplication: regular submission.
Proceedings of the SPAA 2011: Proceedings of the 23rd Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2011
Proceedings of the SPAA 2011: Proceedings of the 23rd Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2011
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
2010
SIAM J. Sci. Comput., 2010
CoRR, 2010
2009
Communication-optimal parallel and sequential Cholesky decomposition: extended abstract.
Proceedings of the SPAA 2009: Proceedings of the 21st Annual ACM Symposium on Parallelism in Algorithms and Architectures, 2009
2006