Holger Fehske
Orcid: 0000-0003-2146-8203
According to our database1,
Holger Fehske
authored at least 28 papers
between 2002 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
2023
ACM Trans. Parallel Comput., September, 2023
CoRR, 2023
2021
A domain-specific language and matrix-free stencil code for investigating electronic properties of Dirac and topological materials.
Int. J. High Perform. Comput. Appl., 2021
2020
Proceedings of the Software for Exascale Computing - SPPEXA 2016-2019, 2020
A Recursive Algebraic Coloring Technique for Hardware-efficient Symmetric Sparse Matrix-vector Multiplication.
ACM Trans. Parallel Comput., 2020
Understanding HPC Benchmark Performance on Intel Broadwell and Cascade Lake Processors.
Proceedings of the High Performance Computing - 35th International Conference, 2020
2018
CoRR, 2018
Proceedings of the High Performance Computing - 33rd International Conference, 2018
2017
GHOST: Building Blocks for High Performance Sparse Linear Algebra on Heterogeneous Systems.
Int. J. Parallel Program., 2017
PVSC-DTM: A domain-specific language and matrix-free stencil code for investigating electronic properties of Dirac and topological materials.
CoRR, 2017
2016
Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016
Performance Engineering and Energy Efficiency of Building Blocks for Large, Sparse Eigenvalue Computations on Heterogeneous Supercomputers.
Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016
High-performance implementation of Chebyshev filter diagonalization for interior eigenvalue computations.
J. Comput. Phys., 2016
2015
SIAM J. Sci. Comput., 2015
Performance Engineering of the Kernel Polynomal Method on Large-Scale CPU-GPU Systems.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
2014
A Unified Sparse Matrix Data Format for Efficient General Sparse Matrix-Vector Multiplication on Modern Processors with Wide SIMD Units.
SIAM J. Sci. Comput., 2014
Performance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems.
CoRR, 2014
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014
2013
CoRR, 2013
2012
Sparse Matrix-vector Multiplication on GPGPU Clusters: A New Storage Format and a Scalable Implementation.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012
2011
Hybrid-Parallel Sparse Matrix-Vector Multiplication with Explicit Communication Overlap on Current Multicore-Based Systems.
Parallel Process. Lett., 2011
J. Comput. Phys., 2011
Parallel Sparse Matrix-Vector Multiplication as a Test Case for Hybrid MPI+OpenMP Programming.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
2009
Performance limitations for sparse matrix-vector multiplications on current multicore environments
CoRR, 2009
Efficient Temporal Blocking for Stencil Computations by Multicore-Aware Wavefront Parallelization.
Proceedings of the 33rd Annual IEEE International Computer Software and Applications Conference, 2009
2003
Exact Numerical Treatment of Finite Quantum Systems Using Leading-Edge Supercomputers.
Proceedings of the Modeling, 2003
2002
Proceedings of the High Performance Computing for Computational Science, 2002