Jan Eitzinger

According to our database1, Jan Eitzinger authored at least 46 papers between 2006 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms.
Future Gener. Comput. Syst., December, 2023

MD-Bench: Engineering the in-core performance of short-range molecular dynamics kernels from state-of-the-art simulation packages.
CoRR, 2023

2022
MD-Bench: A Generic Proxy-App Toolbox for State-of-the-Art Molecular Dynamics Algorithms.
Proceedings of the Parallel Processing and Applied Mathematics, 2022

2021
An instrumentation framework for performance analysis of Halide schedules.
J. Comput. Lang., 2021

tinyMD: Mapping molecular dynamics simulations to heterogeneous hardware using partial evaluation.
J. Comput. Sci., 2021

2020
tinyMD: A Portable and Scalable Implementation for Pairwise Interactions Simulations.
CoRR, 2020

2019
ClusterCockpit - A web application for job-specific performance monitoring.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019

2018
Unified Code Generation for the Parallel Computation of Pairwise Interactions Using Partial Evaluation.
Proceedings of the 17th International Symposium on Parallel and Distributed Computing, 2018

2017
Validation of hardware events for successful performance pattern identification in High Performance Computing.
CoRR, 2017

Kerncraft: A Tool for Analytic Performance Modeling of Loop Kernels.
CoRR, 2017

Performance analysis of the Kahan-enhanced scalar product on current multi-core and many-core processors.
Concurr. Comput. Pract. Exp., 2017

LIKWID Monitoring Stack: A Flexible Framework Enabling Job Specific Performance monitoring for the masses.
Proceedings of the 2017 IEEE International Conference on Cluster Computing, 2017

2016
Performance analysis of the Kahan-enhanced scalar product on current multi- and manycore processors.
CoRR, 2016

Chip-level and multi-node analysis of energy-optimized lattice Boltzmann CFD simulations.
Concurr. Comput. Pract. Exp., 2016

Exploring performance and power properties of modern multi-core chips via simple machine models.
Concurr. Comput. Pract. Exp., 2016

Analysis of Intel's Haswell Microarchitecture Using the ECM Model and Microbenchmarks.
Proceedings of the Architecture of Computing Systems - ARCS 2016, 2016

2015
Performance analysis of the Kahan-enhanced scalar product on current multicore processors.
CoRR, 2015

Execution-Cache-Memory Performance Model: Introduction and Validation.
CoRR, 2015

Automatic loop kernel analysis and performance modeling with Kerncraft.
Proceedings of the 6th International Workshop on Performance Modeling, 2015

Performance Analysis of the Kahan-Enhanced Scalar Product on Current Multicore Processors.
Proceedings of the Parallel Processing and Applied Mathematics, 2015

Quantifying Performance Bottlenecks of Stencil Computations Using the Execution-Cache-Memory Model.
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015

2014
Tools and methods for measuring and tuning the energy efficiency of HPC systems.
Sci. Program., 2014

Comparing the performance of different x86 SIMD instruction sets for a medical imaging application on modern multi- and manycore chips.
Proceedings of the 2014 Workshop on Programming models for SIMD/Vector processing, 2014

Overhead Analysis of Performance Counter Measurements.
Proceedings of the 43rd International Conference on Parallel Processing Workshops, 2014

Performance Engineering for a Medical Imaging Application on the Intel Xeon Phi Accelerator.
Proceedings of the ARCS 2014, 2014

2013
Pushing the limits for medical image reconstruction on recent standard multicore processors.
Int. J. High Perform. Comput. Appl., 2013

Optimization of FASTEST-3D for Modern Multicore Systems
CoRR, 2013

Optimizing IBM algorithmics' mark-to-future aggregation engine for real-time counterparty credit risk scoring.
Proceedings of WHPCF'13: 6th Workshop on High Performance Computational Finance, 2013

Topic 11: Multicore and Manycore Programming - (Introduction).
Proceedings of the Euro-Par 2013 Parallel Processing, 2013

2012
Expression Templates Revisited: A Performance Analysis of Current Methodologies.
SIAM J. Sci. Comput., 2012

Exploring performance and power properties of modern multicore chips via simple machine models
CoRR, 2012

Best practices for HPM-assisted performance engineering on modern multicore processors
CoRR, 2012

High performance smart expression template math libraries.
Proceedings of the 2012 International Conference on High Performance Computing & Simulation, 2012

Performance Patterns and Hardware Metrics on Modern Multicore Processors: Best Practices for Performance Engineering.
Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012

2011
Efficient multicore-aware parallelization strategies for iterative stencil computations.
J. Comput. Sci., 2011

Expression Templates Revisited: A Performance Analysis of the Current ET Methodology
CoRR, 2011

Poster: LIKWID: lightweight performance tools.
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

likwid-bench: An Extensible Microbenchmarking Platform for x86 Multicore Compute Nodes.
Proceedings of the Tools for High Performance Computing 2011, 2011

2010
Leveraging Shared Caches for Parallel Temporal Blocking of Stencil Codes on Multicore Processors and Clusters.
Parallel Process. Lett., 2010

LIKWID: A Lightweight Performance-Oriented Tool Suite for x86 Multicore Environments.
Proceedings of the 39th International Conference on Parallel Processing, 2010

LIKWID: Lightweight Performance Tools.
Proceedings of the Competence in High Performance Computing 2010, 2010

2009
Multi-core architectures: Complexities of performance prediction and the impact of cache topology
CoRR, 2009

Introducing a Performance Model for Bandwidth-Limited Loop Kernels.
Proceedings of the Parallel Processing and Applied Mathematics, 2009

2008
Efficiency improvements of iterative numerical algorithms on modern architectures.
PhD thesis, 2008

Optimising a 3D multigrid algorithm for the IA-64 architecture.
Int. J. Comput. Sci. Eng., 2008

2006
ORCAN: A platform for complex parallel simulation software.
Proceedings of the ARCS 2006, 2006


  Loading...