Hartwig Anzt
Orcid: 0000-0003-2177-952XAffiliations:
- University of Tennessee, Knoxville, TN, USA
According to our database1,
Hartwig Anzt
authored at least 127 papers
between 2010 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Future Gener. Comput. Syst., 2025
Multifacets of lossy compression for scientific data in the Joint-Laboratory of Extreme Scale Computing.
Future Gener. Comput. Syst., 2025
2024
Ginkgo - A math library designed to accelerate Exascale Computing Project science applications.
Int. J. High Perform. Comput. Appl., 2024
Batched sparse and mixed-precision linear algebra interface for efficient use of GPU hardware accelerators in scientific applications.
Future Gener. Comput. Syst., 2024
Comput. Sci. Eng., 2024
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
2023
Future Gener. Comput. Syst., December, 2023
Integrating batched sparse iterative solvers for the collision operator in fusion plasma simulations on GPUs.
J. Parallel Distributed Comput., August, 2023
Int. J. High Perform. Comput. Appl., July, 2023
Int. J. High Perform. Comput. Appl., March, 2023
Using Ginkgo's memory accessor for improving the accuracy of memory-bound low precision BLAS.
Softw. Pract. Exp., 2023
GPU-Resident Sparse Direct Linear Solvers for Alternating Current Optimal Power Flow Analysis.
CoRR, 2023
Concurr. Comput. Pract. Exp., 2023
Sparse matrix-vector and matrix-multivector products for the truncated SVD on graphics processors.
Concurr. Comput. Pract. Exp., 2023
Proceedings of the High Performance Computing - 38th International Conference, 2023
Task-Based Polar Decomposition Using SLATE on Massively Parallel Systems with Hardware Accelerators.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
Proceedings of the Euro-Par 2023: Parallel Processing Workshops - Euro-Par 2023 International Workshops, Limassol, Cyprus, August 28, 2023
2022
ACM Trans. Math. Softw., 2022
Int. J. High Perform. Comput. Appl., 2022
Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units.
Concurr. Comput. Pract. Exp., 2022
Proceedings of the Accelerating Science and Engineering Discoveries Through Integrated Research Infrastructure for Experiment, Big Data, Modeling and Simulation, 2022
Proceedings of the IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems, 2022
Proceedings of the 2022 SIAM Conference on Parallel Processing for Scientific Computing, 2022
Proceedings of the Parallel Processing and Applied Mathematics, 2022
Batched sparse iterative solvers on GPU for the collision operator for fusion plasma simulations.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
Proceedings of the IEEE/ACM International Workshop on Education for High Performance Computing, 2022
2021
Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software.
ACM Trans. Math. Softw., 2021
Crediting pull requests to open source research software as an academic contribution.
J. Comput. Sci., 2021
Int. J. High Perform. Comput. Appl., 2021
Int. J. High Perform. Comput. Appl., 2021
Proceedings of the 12th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2021
Proceedings of the Computational Science - ICCS 2021, 2021
Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021
Mixed Precision Incomplete and Factorized Sparse Approximate Inverse Preconditioning on GPUs.
Proceedings of the Euro-Par 2021: Parallel Processing, 2021
2020
Dataset, March, 2020
ACM Trans. Parallel Comput., 2020
ACM Trans. Parallel Comput., 2020
J. Open Source Softw., 2020
An environment for sustainable research software in Germany and beyond: current state, open challenges, and call for action.
F1000Research, 2020
Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse Linear Algebra Computations.
CoRR, 2020
An Environment for Sustainable Research Software in Germany and Beyond: Current State, Open Challenges, and Call for Action.
CoRR, 2020
A customized precision format based on mantissa segmentation for accelerating sparse linear algebra.
Concurr. Comput. Pract. Exp., 2020
Proceedings of the High Performance Computing - 35th International Conference, 2020
Proceedings of the 11th IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2020
Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched Computations.
Proceedings of the 2020 IEEE/ACM Performance Modeling, 2020
Proceedings of the 2020 IEEE High Performance Extreme Computing Conference, 2020
Proceedings of the Euro-Par 2020: Parallel Processing Workshops, 2020
Proceedings of the Euro-Par 2020: Parallel Processing, 2020
Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs.
Proceedings of the Euro-Par 2020: Parallel Processing Workshops, 2020
2019
Variable-size batched Gauss-Jordan elimination for block-Jacobi preconditioning on graphics processors.
Parallel Comput., 2019
Int. J. High Perform. Comput. Appl., 2019
Int. J. High Perform. Comput. Appl., 2019
Adaptive precision in block-Jacobi preconditioning for iterative sparse linear system solvers.
Concurr. Comput. Pract. Exp., 2019
Towards Continuous Benchmarking: An Automated Performance Evaluation Framework for High Performance Software.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2019
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019
2018
Parallel Comput., 2018
Using Jacobi iterations and blocking for solving sparse triangular systems in incomplete factorization preconditioning.
J. Parallel Distributed Comput., 2018
Int. J. High Perform. Comput. Appl., 2018
Residual Replacement in Mixed-Precision Iterative Refinement for Sparse Linear Systems.
Proceedings of the High Performance Computing, 2018
High-Performance GPU Implementation of PageRank with Reduced Precision Based on Mantissa Segmentation.
Proceedings of the 8th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2018
Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018
Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018
Proceedings of the Euro-Par 2018: Parallel Processing Workshops, 2018
2017
Int. J. High Perform. Comput. Appl., 2017
Proceedings of the Seventh Workshop on Irregular Applications: Architectures and Algorithms, 2017
Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2017
Proceedings of the 8th International Workshop on Programming Models and Applications for Multicores and Manycores, 2017
Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning.
Proceedings of the 46th International Conference on Parallel Processing, 2017
Proceedings of the International Conference on Computational Science, 2017
Proceedings of the Handbook of Big Data Technologies, 2017
2016
Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016
Implementation and Tuning of Batched Cholesky Factorization and Solve for NVIDIA GPUs.
IEEE Trans. Parallel Distributed Syst., 2016
Numer. Algorithms, 2016
Acta Numer., 2016
Proceedings of the High Performance Computing for Computational Science - VECPAR 2016, 2016
Proceedings of the 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016
2015
Int. J. High Perform. Comput. Appl., 2015
Concurr. Comput. Pract. Exp., 2015
Unveiling the performance-energy trade-off in iterative linear system solvers for multithreaded processors.
Concurr. Comput. Pract. Exp., 2015
Proceedings of the High Performance Computing - 30th International Conference, 2015
Proceedings of the Symposium on High Performance Computing, 2015
GPU-accelerated co-design of induced dimension reduction: algorithmic fusion and kernel overlap.
Proceedings of the 2nd International Workshop on Hardware-Software Co-Design for High Performance Computing, 2015
Proceedings of the 6th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2015
Proceedings of the 3rd International Workshop on Energy Efficient Supercomputing, 2015
Energy efficiency and performance frontiers for sparse computations on GPU supercomputers.
Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015
Proceedings of the Euro-Par 2015: Parallel Processing, 2015
Proceedings of the 2015 IEEE International Conference on Big Data (IEEE BigData 2015), Santa Clara, CA, USA, October 29, 2015
2014
Self-adaptive Multiprecision Preconditioners on Multicore and Manycore Architectures.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Eugene, OR, USA, June 30, 2014
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014
2013
J. Parallel Distributed Comput., 2013
Performance and Energy Analysis of the Iterative Solution of Sparse Linear Systems on Multicore and Manycore Architectures.
Proceedings of the Parallel Processing and Applied Mathematics, 2013
Reformulated Conjugate Gradient for the Energy-Aware Solution of Linear Systems on GPUs.
Proceedings of the 42nd International Conference on Parallel Processing, 2013
2012
Proceedings of the International Conference on Computational Science, 2012
Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors.
Comput. Sci. Res. Dev., 2012
Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012
GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012
2011
Proceedings of the Tools for High Performance Computing 2011, 2011
Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
Analysis and optimization of power consumption in the iterative solution of sparse linear systems on multi-core and many-core platforms.
Proceedings of the 2011 International Green Computing Conference and Workshops, 2011
2010
Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms - An evaluation of different solver and hardware configurations.
Comput. Sci. Res. Dev., 2010
An Error Correction Solver for Linear Systems: Evaluation of Mixed Precision Implementations.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010
Mixed Precision Iterative Refinement Methods for Linear Systems: Convergence Analysis Based on Krylov Subspace Methods.
Proceedings of the Applied Parallel and Scientific Computing, 2010