Holger Fehske

Orcid: 0000-0003-2146-8203

According to our database1, Holger Fehske authored at least 28 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Cache Blocking of Distributed-Memory Parallel Matrix Power Kernels.
CoRR, 2024

2023
Orthogonal Layers of Parallelism in Large-Scale Eigenvalue Computations.
ACM Trans. Parallel Comput., September, 2023

Algebraic Temporal Blocking for Sparse Iterative Solvers on Multi-Core CPUs.
CoRR, 2023

2021
A domain-specific language and matrix-free stencil code for investigating electronic properties of Dirac and topological materials.
Int. J. High Perform. Comput. Appl., 2021

2020

A Recursive Algebraic Coloring Technique for Hardware-efficient Symmetric Sparse Matrix-vector Multiplication.
ACM Trans. Parallel Comput., 2020

Understanding HPC Benchmark Performance on Intel Broadwell and Cascade Lake Processors.
Proceedings of the High Performance Computing - 35th International Conference, 2020

2018
Chebyshev Filter Diagonalization on Modern Manycore Processors and GPGPUs.
CoRR, 2018

Chebyshev Filter Diagonalization on Modern Manycore Processors and GPGPUs.
Proceedings of the High Performance Computing - 33rd International Conference, 2018

2017
GHOST: Building Blocks for High Performance Sparse Linear Algebra on Heterogeneous Systems.
Int. J. Parallel Program., 2017

PVSC-DTM: A domain-specific language and matrix-free stencil code for investigating electronic properties of Dirac and topological materials.
CoRR, 2017

2016
Towards an Exascale Enabled Sparse Solver Repository.
Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016

Performance Engineering and Energy Efficiency of Building Blocks for Large, Sparse Eigenvalue Computations on Heterogeneous Supercomputers.
Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016

High-performance implementation of Chebyshev filter diagonalization for interior eigenvalue computations.
J. Comput. Phys., 2016

2015
Increasing the Performance of the Jacobi-Davidson Method by Blocking.
SIAM J. Sci. Comput., 2015

Performance Engineering of the Kernel Polynomal Method on Large-Scale CPU-GPU Systems.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

2014
A Unified Sparse Matrix Data Format for Efficient General Sparse Matrix-Vector Multiplication on Modern Processors with Wide SIMD Units.
SIAM J. Sci. Comput., 2014

Performance Engineering of the Kernel Polynomial Method on Large-Scale CPU-GPU Systems.
CoRR, 2014

ESSEX: Equipping Sparse Solvers for Exascale.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

2013
A unified sparse matrix data format for modern processors with wide SIMD units.
CoRR, 2013

2012
Sparse Matrix-vector Multiplication on GPGPU Clusters: A New Storage Format and a Scalable Implementation.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

2011
Hybrid-Parallel Sparse Matrix-Vector Multiplication with Explicit Communication Overlap on Current Multicore-Based Systems.
Parallel Process. Lett., 2011

High-order commutator-free exponential time-propagation of driven quantum systems.
J. Comput. Phys., 2011

Parallel Sparse Matrix-Vector Multiplication as a Test Case for Hybrid MPI+OpenMP Programming.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

2009
Performance limitations for sparse matrix-vector multiplications on current multicore environments
CoRR, 2009

Efficient Temporal Blocking for Stencil Computations by Multicore-Aware Wavefront Parallelization.
Proceedings of the 33rd Annual IEEE International Computer Software and Applications Conference, 2009

2003
Exact Numerical Treatment of Finite Quantum Systems Using Leading-Edge Supercomputers.
Proceedings of the Modeling, 2003

2002
Fast Sparse Matrix-Vector Multiplication for TeraFlop/s Computers.
Proceedings of the High Performance Computing for Computational Science, 2002


  Loading...