Hans Pabst

According to our database1, Hans Pabst authored at least 20 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Towards a high-performance AI compiler with upstream MLIR.
CoRR, 2024

Harnessing Deep Learning and HPC Kernels via High-Level Loop and Tensor Abstractions on CPU Architectures.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023
Harnessing Deep Learning and HPC Kernels via High-Level Loop and Tensor Abstractions on CPU Architectures.
CoRR, 2023

2022
Towards electronic structure-based <i>ab-initio</i> molecular dynamics simulations with hundreds of millions of atoms.
Parallel Comput., 2022

Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning and HPC Workloads.
Frontiers Appl. Math. Stat., 2022

2021
Enabling Electronic Structure-Based Ab-Initio Molecular Dynamics Simulations with Hundreds of Millions of Atoms.
CoRR, 2021

Tensor Processing Primitives: A Programming Abstraction for Efficiency and Portability in Deep Learning Workloads.
CoRR, 2021

Tensor processing primitives: a programming abstraction for efficiency and portability in deep learning workloads.
Proceedings of the International Conference for High Performance Computing, 2021

Reduced Precision Strategies for Deep Learning: A High Energy Physics Generative Adversarial Network Use Case.
Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods, 2021

2020
Harnessing Deep Learning via a Single Building Block.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020

2019
High-Performance Deep Learning via a Single Building Block.
CoRR, 2019

2018
Machine Learning in High Energy Physics Community White Paper.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2018

Distributed Training of Generative Adversarial Networks for Fast Detector Simulation.
Proceedings of the High Performance Computing, 2018

Anatomy of high-performance deep learning convolutions on SIMD architectures.
Proceedings of the International Conference for High Performance Computing, 2018

2017
Porting of the DBCSR Library for Sparse Matrix-Matrix Multiplications to Intel Xeon Phi Systems.
Proceedings of the Parallel Computing is Everywhere, 2017

2016
Efficiency of High Order Spectral Element Methods on Petascale Architectures.
Proceedings of the High Performance Computing - 31st International Conference, 2016

LIBXSMM: accelerating small matrix multiplications by runtime code generation.
Proceedings of the International Conference for High Performance Computing, 2016

2012
Performance of a Structure-Detecting SpMV Using the CSR Matrix Representation.
Proceedings of the 11th International Symposium on Parallel and Distributed Computing, 2012

2011
Towards High-Performance Implementations of a Custom HPC Kernel Using ® Array Building Blocks.
Proceedings of the Facing the Multicore - Challenge II, 2011

2010
Multigrid optical flow for deformable medical volume registration.
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2010


  Loading...