Bernd Mohr

Orcid: 0000-0001-9960-5867

Affiliations:
  • Jülich Supercomputing Centre, Germany


According to our database1, Bernd Mohr authored at least 107 papers between 1987 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Conquering Noise With Hardware Counters on HPC Systems.
Proceedings of the IEEE/ACM Workshop on Programming and Performance Visualization Tools, 2022

Early Experiences of Noise-Sensitivity Performance Analysis of a Distributed Deep Learning Framework.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

2020
Tools for GPU Computing - Debugging and Performance Analysis of Heterogenous HPC Applications.
Supercomput. Front. Innov., 2020

2019
Score-P and OMPT: Navigating the Perils of Callback-Driven Parallel Runtime Introspection.
Proceedings of the OpenMP: Conquering the Full Hardware Spectrum, 2019

2018
Scalasca analysis report of the ASCI Sweep3D benchmark on 294,912 processes in virtual-node mode on IBM Blue Gene/P with manually annotated iterations.
Dataset, August, 2018

Scalasca analysis report of the ASCI Sweep3D benchmark on 65,536 processes in virtual-node mode on IBM Blue Gene/P.
Dataset, April, 2018

Scalasca analysis report of the ASCI Sweep3D benchmark on 294,912 processes in virtual-node mode on IBM Blue Gene/P.
Dataset, April, 2018

The High-Q Club: Experience with Extreme-scaling Application Codes.
Supercomput. Front. Innov., 2018

Big data and extreme-scale computing.
Int. J. High Perform. Comput. Appl., 2018

2017
Performance Portability in Extreme Scale Computing (Dagstuhl Seminar 17431).
Dagstuhl Reports, 2017

2016
Automatic Performance Modeling of HPC Applications.
Proceedings of the Software for Exascale Computing - SPPEXA 2013-2015, 2016

Using Directed Variance to Identify Meaningful Views in Call-Path Performance Profiles.
Proceedings of the Third Workshop on Visual Performance Analysis, 2016

Correlating sub-phenomena in performance data in the frequency domain.
Proceedings of the 6th IEEE Symposium on Large Data Analysis and Visualization, 2016

Performance Optimization of Parallel Applications in Diverse On-Demand Development Teams.
Proceedings of the High-Performance Scientific Computing, 2016

2015
Connecting Performance Analysis and Visualization (Dagstuhl Perspectives Workshop 14022).
Dagstuhl Manifestos, 2015

The Mont-Blanc Project: First Phase Successfully Finished.
CoRR, 2015

Separating the wheat from the chaff: identifying relevant and similar performance data with visual analytics.
Proceedings of the 2nd Workshop on Visual Performance Analysis, 2015

Cube v4: From Performance Report Explorer to Performance Analysis Tool.
Proceedings of the International Conference on Computational Science, 2015

2014
Scalable parallel performance measurement and analysis tools - state-of-the-art and future challenges.
Supercomput. Front. Innov., 2014

Connecting Performance Analysis and Visualization to Advance Extreme Scale Computing (Dagstuhl Perspectives Workshop 14022).
Dagstuhl Reports, 2014

Modeling CPU Energy Consumption of HPC Applications on the IBM POWER7.
Proceedings of the 22nd Euromicro International Conference on Parallel, 2014

Catwalk: A Quick Development Path for Performance Models.
Proceedings of the Euro-Par 2014: Parallel Processing Workshops, 2014

Performance Measurement and Analysis of Transactional Memory and Speculative Execution on IBM Blue Gene/Q.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013
Early experiences with scientific applications on the IBM Blue Gene/Q supercomputer.
IBM J. Res. Dev., 2013

Profiling Hybrid HMPP Applications with Score-P on Heterogeneous Hardware.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

Binary Instrumentation for Scalable Performance Measurement of OpenMP Applications.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

2012
Scalable detection of MPI-2 remote memory access inefficiency patterns.
Int. J. High Perform. Comput. Appl., 2012

Determine energy-saving potential in wait-states of large-scale parallel programs.
Comput. Sci. Res. Dev., 2012

The HOPSA Workflow and Tools.
Proceedings of the Tools for High Performance Computing 2012, 2012

Extending Scalasca's Analysis Features.
Proceedings of the Tools for High Performance Computing 2012, 2012

Performance Analysis Techniques for Task-Based OpenMP Applications.
Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

2011
PMPI Tools.
Proceedings of the Encyclopedia of Parallel Computing, 2011

The International Exascale Software Project roadmap.
Int. J. High Perform. Comput. Appl., 2011

Electronic poster: eeclust: energy-efficient cluster computing.
Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2011

Managing hardware power saving modes for high performance computing.
Proceedings of the 2011 International Green Computing Conference and Workshops, 2011

Score-P.
Proceedings of the Entwicklung und Evolution von Forschungssoftware: Tagungsband des Workshops, 2011

2010
Large-Scale Performance Analysis of Sweep3D with the Scalasca Toolset.
Parallel Process. Lett., 2010

Performance measurement and analysis tools for extremely scalable systems.
Concurr. Comput. Pract. Exp., 2010

The Scalasca performance toolset architecture.
Concurr. Comput. Pract. Exp., 2010

How to Reconcile Event-Based Performance Analysis with Tasking in OpenMP.
Proceedings of the Beyond Loop Level Parallelism in OpenMP: Accelerators, 2010

Performance analysis of Sweep3D on Blue Gene/P with the Scalasca toolset.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Guided Performance Analysis Combining Profile and Trace Tools.
Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

10181 Executive Summary - Program Development for Extreme-Scale Computing.
Proceedings of the Program Development for Extreme-Scale Computing, 02.05. - 07.05.2010, 2010

10181 Abstracts Collection - Program Development for Extreme-Scale Computing.
Proceedings of the Program Development for Extreme-Scale Computing, 02.05. - 07.05.2010, 2010

eeClust: Energy-Efficient Cluster Computing.
Proceedings of the Competence in High Performance Computing 2010, 2010

2009
A scalable tool architecture for diagnosing wait states in massively parallel applications.
Parallel Comput., 2009

Performance at Exascale.
Int. J. High Perform. Comput. Appl., 2009

Summary of the IESP White Papers.
Int. J. High Perform. Comput. Appl., 2009

Recent Developments in the Scalasca Toolset.
Proceedings of the Tools for High Performance Computing 2009, 2009

Enhanced Performance Analysis of Multi-core Applications with an Integrated Tool-chain - Using Scalasca and Vampir to Optimise the Metal Forming Simulation FE Software INDEED.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

How to Accelerate an Application: a Practical Case Study in Combustion Modelling.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Parallel Programming Tools for Multi-core Architectures.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Tools for scalable performance analysis on Petascale systems.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

2008
Usage of the SCALASCA toolset for scalable performance analysis of large-scale parallel applications.
Proceedings of the Tools for High Performance Computing, 2008

2007
Selected papers from EuroPVM/MPI 2006.
Parallel Comput., 2007

Automatic analysis of inefficiency patterns in parallel applications.
Concurr. Comput. Pract. Exp., 2007

A framework for performance analysis of Co-Array Fortran.
Concurr. Comput. Pract. Exp., 2007

A test suite for parallel performance analysis tools.
Concurr. Comput. Pract. Exp., 2007

To Infinity and Beyond?! On Scaling Performance Measurement and Analysis Tools for Parallel Programming.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 14th European PVM/MPI User's Group Meeting, Paris, France, September 30, 2007

Automatic Trace-Based Performance Analysis of Metacomputing Applications.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

07341 Abstracts Collection - Code Instrumentation and Modeling for Parallel Performance Analysis.
Proceedings of the Code Instrumentation and Modeling for Parallel Performance Analysis, 19.08., 2007

07341 Executive Summary - Code Instrumentation and Modeling for Parallel Performance Analysis.
Proceedings of the Code Instrumentation and Modeling for Parallel Performance Analysis, 19.08., 2007

2006
M09 - Program analysis tools for massively parallel applications: how to achieve highest performance.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

S11 - Principles and practice of experimental performance measurement and analysis of parallel applications.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Performance Tools for Parallel Programming.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Scalable Parallel Trace-Based Performance Analysis.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Integrated Runtime Measurement Summarisation and Selective Event Tracing for Scalable Parallel Execution Performance Diagnosis.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

A Parallel Trace-Data Interface for Scalable Performance Analysis.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Performance Instrumentation and Compiler Optimizations for MPI/OpenMP Applications.
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2006

Topic 2: Performance Prediction and Evaluation.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Specification of Inefficiency Patterns for MPI-2 One-Sided Communication.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Large Event Traces in Parallel Performance Analysis.
Proceedings of the ARCS 2006, 2006

2005
A Scalable Approach to MPI Application Performance Analysis.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

Holistic Hardware Counter Performance Analysis of Parallel Programs.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Performance Analysis of One-sided Communication Mechanisms.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Performance Analysis and Visualization of the N-Body Tree Code PEPC on Massively Parallel Computers.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Performance Analysis of Large-Scale OpenMP and Hybrid MPI/OpenMP Applications with Vampir NG.
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005

Automatic Experimental Analysis of Communication Patterns in Virtual Topologies.
Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005

A Performance Measurement Infrastructure for Co-array Fortran.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

Event-Based Measurement and Analysis of One-Sided Communication.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

2004
Efficient Pattern Search in Large Traces Through Successive Refinement.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Profiling and Tracing OpenMP Applications with POMP Based Monitoring Libraries.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Evaluating OpenMP Performance Analysis Tools with the APART Test Suite.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

2003
Automatic performance analysis of hybrid MPI/OpenMP applications.
J. Syst. Archit., 2003

Hardware-Counter Based Automatic Performance Analysis of Parallel Programs.
Proceedings of the Parallel Computing: Software Technology, 2003

Initial Design of a Test Suite for Automatic Performance Analysis Tools.
Proceedings of the Eighth International Workshop on High-Level Parallel Programming Models and Supportive Environments (HIPS'03), 2003

KOJAK - A Tool Set for Automatic Performance Analysis of Parallel Programs.
Proceedings of the Euro-Par 2003. Parallel Processing, 2003

2002
Design and Prototype of a Performance Tool Interface for OpenMP.
J. Supercomput., 2002

2001
Specifying Performance Properties of Parallel Applications Using Compound Events.
Parallel Distributed Comput. Pract., 2001

Topic 02: Performance Evaluation and Prediction.
Proceedings of the Euro-Par 2001: Parallel Processing, 2001

2000
A Tool Framework for Static and Dynamic Analysis of Object-Oriented Software with Templates.
Proceedings of the Proceedings Supercomputing 2000, 2000

Automatic Performance Analysis of MPI Applications Based on Event Traces.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

1999
Performance analysis on CRAY T3E.
Proceedings of the Seventh Euromicro Workshop on Parallel and Distributed Processing. PDP'99, 1999

EARL - A Programmable and Extensible Toolkit for Analyzing Event Traces of Message Passing Programs.
Proceedings of the High-Performance Computing and Networking, 7th International Conference, 1999

Parallel / High-Performance Object-Oriented Scientific Computing.
Proceedings of the Object-Oriented Technology, ECOOP'99 Workshop Reader, 1999

Requirements/ Issues for (Performance) Tools for C++ and Java.
Proceedings of the Object-Oriented Technology, ECOOP'99 Workshop Reader, 1999

1995
Speedy: An Integrated Performance Extrapolation Tool for pC++ Programs.
Proceedings of the Quantitative Evaluation of Computing and Communication Systems, 1995

Messung und Modellierung paralleler und verteilter Rechensysteme
Vieweg+Teubner Verlag, ISBN: 978-3-519-02144-5, 1995

1994
Distributed Performance Monitoring: Methods, Tools, and Applications.
IEEE Trans. Parallel Distributed Syst., 1994

Performance Analysis of pC++: A Portable Data-Parallel Programming System for Scalable Parallel Computers.
Proceedings of the 8th International Symposium on Parallel Processing, 1994

TAU: A Portable Parallel Program Analysis Environment for pC++.
Proceedings of the Parallel Processing: CONPAR 94, 1994

1993
Implementing a parallel C++ runtime system for scalable parallel systems.
Proceedings of the Proceedings Supercomputing '93, 1993

1992
Ereignisbasierte Rechneranalysesysteme zur Bewertung paralleler und verteilter Systeme.
PhD thesis, 1992

1991
SIMPLE: A Performance Evaluation Tool Environment for Parallel and Distributed Systems.
Proceedings of the Distributed Memory Computing, 2nd European Conference, 1991

1990
Integrating Monitoring and Modeling to a Performance Evaluation Methodology.
Proceedings of the Entwurf und Betrieb verteilter Systeme, 1990

Performance Evaluation of Parallel Programs in Parallel and Distributed Systems.
Proceedings of the CONPAR 90, 1990

1987
ZÄHLMONITOR 4: Ein Monitorsystem für das Hardware- und Hybrid-Monitoring von Multiprozessor- und Multicomputer-Systemen.
Proceedings of the Messung, Modellierung und Bewertung von Rechensystemen, 4. GI/ITG-Fachtagung, Erlangen, 29. September, 1987


  Loading...