Hartmut Kaiser

Orcid: 0000-0002-8712-2806

According to our database1, Hartmut Kaiser authored at least 92 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Simulating stellar merger using HPX/Kokkos on A64FX on Supercomputer Fugaku.
J. Supercomput., August, 2024

Shared Memory Parallelism in Modern C++ and HPX.
SN Comput. Sci., June, 2024

Distributed astrophysics simulations using Octo-Tiger with RISC-V CPUs using HPX and Kokkos.
CoRR, 2024

Evaluating AI-generated code for C++, Fortran, Go, Java, Julia, Matlab, Python, R, and Rust.
CoRR, 2024

Storm Surge Modeling in the AI ERA: Using LSTM-based Machine Learning for Enhancing Forecasting Accuracy.
CoRR, 2024

HPX - An open source C++ Standard Library for Parallelism and Concurrency.
CoRR, 2024

Accelerating LULESH using HPX - the C++ Standard Library for Parallelism and Concurrency.
Proceedings of the Practice and Experience in Advanced Research Computing 2024: Human Powered Computing, 2024

HPX with Spack and Singularity Containers: Evaluating Overheads for HPX/Kokkos Using an Astrophysics Application.
Proceedings of the Asynchronous Many-Task Systems and Applications, 2024

Parallel C++ - Efficient and Scalable High-Performance Parallel Programming Using HPX
Springer, ISBN: 978-3-031-54368-5, 2024

2023
HPX+LCI PAW-ATM23 Artifact Archive.
Dataset, September, 2023

Traveler: Navigating Task Parallel Traces for Performance Analysis.
IEEE Trans. Vis. Comput. Graph., 2023

Design and Analysis of the Network Software Stack of an Asynchronous Many-task System - The LCI parcelport of HPX.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Evaluating HPX and Kokkos on RISC-V using an astrophysics application Octo-Tiger.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Stellar Mergers with HPX-Kokkos and SYCL: Methods of using an Asynchronous Many-Task Runtime System with SYCL.
Proceedings of the 2023 International Workshop on OpenCL, 2023

Benchmarking the Parallel 1D Heat Equation Solver in Chapel, Charm++, C++, HPX, Go, Julia, Python, Rust, Swift, and Java.
Proceedings of the Euro-Par 2023: Parallel Processing Workshops - Euro-Par 2023 International Workshops, Limassol, Cyprus, August 28, 2023

2022
Distributed, combined CPU and GPU profiling within HPX using APEX.
CoRR, 2022

From Task-Based GPU Work Aggregation to Stellar Mergers: Turning Fine-Grained CPU Tasks into Portable GPU Kernels.
Proceedings of the IEEE/ACM International Workshop on Performance, 2022

Quantifying Overheads in Charm++ and HPX Using Task Bench.
Proceedings of the Euro-Par 2022: Parallel Processing Workshops, 2022

Halide Code Generation Framework in Phylanx.
Proceedings of the Euro-Par 2022: Parallel Processing Workshops, 2022

From Merging Frameworks to Merging Stars: Experiences using HPX, Kokkos and SIMD Types.
Proceedings of the 7th IEEE/ACM International Workshop on Extreme Scale Programming Models and Middleware, 2022

Towards superior software portability with SHAD and HPX C++ libraries.
Proceedings of the CF '22: 19th ACM International Conference on Computing Frontiers, Turin, Italy, May 17, 2022

2021
Performance Measurements Within Asynchronous Task-Based Runtime Systems: A Double White Dwarf Merger as an Application.
Comput. Sci. Eng., 2021

Octo-Tiger's New Hydro Module and Performance Using HPX+CUDA on ORNL's Summit.
CoRR, 2021

Memory reduction using a ring abstraction over GPU RDMA for distributed quantum Monte Carlo solver.
Proceedings of the PASC '21: Platform for Advanced Scientific Computing Conference, 2021

Beyond Fork-Join: Integration of Performance Portable Kokkos Kernels with HPX.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

Understanding the Effect of Task Granularity on Execution Time in Asynchronous Many-Task Runtime Systems.
Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021

Parallel SIMD - A Policy Based Solution for Free Speed-Up using C++ Data-Parallel Types.
Proceedings of the 6th IEEE/ACM International Workshop on Extreme Scale Programming Models and Middleware, 2021

Octo-Tiger's New Hydro Module and Performance Using HPX+CUDA on ORNL's Summit.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
HPX - The C++ Standard Library for Parallelism and Concurrency.
J. Open Source Softw., 2020

Implementing Software Resiliency in HPX for Extreme Scale Computing.
CoRR, 2020

Supporting OpenMP 5.0 Tasks in hpxMP - A study of an OpenMP implementation within Task Based Runtime Systems.
CoRR, 2020

JetLag: An Interactive, Asynchronous Array Computing Environment.
Proceedings of the PEARC '20: Practice and Experience in Advanced Research Computing, 2020

Performance Analysis of a Quantum Monte Carlo Application on Multiple Hardware Architectures Using the HPX Runtime.
Proceedings of the 11th IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2020

Towards a Scalable and Distributed Infrastructure for Deep Learning Applications.
Proceedings of the Fourth IEEE/ACM Workshop on Deep Learning on Supercomputers, 2020

Towards Distributed Software Resilience in Asynchronous Many- Task Programming Models.
Proceedings of the 10th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, 2020

Deploying a Task-based Runtime System on Raspberry Pi Clusters.
Proceedings of the 5th IEEE/ACM International Workshop on Extreme Scale Programming Models and Middleware, 2020

Distributed Asynchronous Array Computing with the JetLag Environment.
Proceedings of the 9th IEEE/ACM Workshop on Python for High-Performance and Scientific Computing, 2020

2019
Performance Comparison of HPX Versus Traditional Parallelization Strategies for the Discontinuous Galerkin Method.
J. Sci. Comput., 2019

Harnessing billions of tasks for a scalable portable hydrodynamic simulation of the merger of two stars.
Int. J. High Perform. Comput. Appl., 2019

An Introduction to hpxMP - A Modern OpenMP Implementation Leveraging Asynchronous Many-Tasking System.
CoRR, 2019

Scheduling Optimization of Parallel Linear Algebra Algorithms Using Supervised Learning.
Proceedings of the 2019 IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, 2019

From piz daint to the stars: simulation of stellar mergers using high-level abstractions.
Proceedings of the International Conference for High Performance Computing, 2019

Assessing the Performance Impact of using an Active Global Address Space in HPX: A Case for AGAS.
Proceedings of the IEEE/ACM Third Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware, 2019

An Introduction to hpxMP: A Modern OpenMP Implementation Leveraging HPX, An Asynchronous Many-Task System.
Proceedings of the International Workshop on OpenCL, 2019

Runtime Adaptive Task Inlining on Asynchronous Multitasking Runtime Systems.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2018
Implementation of Peridynamics utilizing HPX - the C++ standard library for parallelism and concurrency.
CoRR, 2018

Asynchronous Execution of Python Code on Task-Based Runtime Systems.
Proceedings of the 4th International Workshop on Extreme Scale Programming Models and Middleware, 2018

Integration of CUDA Processing within the C++ Library for Parallelism and Concurrency (HPX).
Proceedings of the 4th International Workshop on Extreme Scale Programming Models and Middleware, 2018

Accelerating Octo-Tiger: Stellar Mergers on Intel Knights Landing with HPX.
Proceedings of the International Workshop on OpenCL, 2018

Methodology for Adaptive Active Message Coalescing in Task Based Runtime Systems.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

2017
HPX Smart Executors.
Proceedings of the Third International Workshop on Extreme Scale Programming Models and Middleware, 2017

Using SYCL as an Implementation Framework for HPX.Compute.
Proceedings of the 5th International Workshop on OpenCL, 2017

Redesigning OP2 Compiler to Use HPX Runtime Asynchronous Techniques.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

A Load-Balanced Parallel and Distributed Sorting Algorithm Implemented with PGX.D.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

Improving the Parallel Performance of an NBody Application Using Adaptive Techniques in HPX.
Proceedings of the 19th IEEE International Conference on High Performance Computing and Communications; 15th IEEE International Conference on Smart City; 3rd IEEE International Conference on Data Science and Systems, 2017

2016
Automatic Task-Based Code Generation for High Performance Domain Specific Embedded Language.
Int. J. Parallel Program., 2016

Closing the Performance Gap with Modern C++.
Proceedings of the High Performance Computing, 2016

A Massively Parallel Distributed N-body Application Implemented with HPX.
Proceedings of the 7th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2016

Extending C++ with co-array semantics.
Proceedings of the 3rd ACM SIGPLAN International Workshop on Libraries, 2016

Using Intrinsic Performance Counters to Assess Efficiency in Task-Based Parallel Applications.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Using HPX and OP2 for Improving Parallel Scaling Performance of Unstructured Grid Applications.
Proceedings of the 45th International Conference on Parallel Processing Workshops, 2016

2015
An Autonomic Performance Environment for Exascale.
Supercomput. Front. Innov., 2015

Higher-level parallelization for local and distributed asynchronous task-based programming.
Proceedings of the First International Workshop on Extreme Scale Programming Models and Middleware, 2015

A Non-intrusive Technique for Interfacing Legacy Fortran Codes with Modern C++ Runtime Systems.
Proceedings of the Third International Symposium on Computing and Networking, 2015

The Performance Implication of Task Size for Applications on the HPX Runtime System.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014
HPX: A Task Based Programming Model in a Global Address Space.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

PXFS: A persistent storage model for extreme Scale.
Proceedings of the International Conference on Computing, Networking and Communications, 2014

2013
Application of the ParalleX execution model to stencil-based problems.
Comput. Sci. Res. Dev., 2013

Using HPX and LibGeoDecomp for scaling HPC applications on heterogeneous supercomputers.
Proceedings of the Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, 2013

Tabulated Equations of State with a Many-tasking Execution Model.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

An early prototype of an autonomic performance environment for exascale.
Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, 2013

2012
Urgent Computing of Storm Surge for North Carolina's Coast.
Proceedings of the International Conference on Computational Science, 2012

Improving the scalability of parallel <i>N</i>-body applications with an event-driven constraint-based execution model.
Int. J. High Perform. Comput. Appl., 2012

Neutron Star Evolutions using Tabulated Equations of State with a New Execution Model
CoRR, 2012

2011
Preliminary design examination of the ParalleX system from a software and hardware perspective.
SIGMETRICS Perform. Evaluation Rev., 2011

Adaptive Mesh Refinement for Astrophysics Applications with ParalleX
CoRR, 2011

An Application Driven Analysis of the ParalleX Execution Model
CoRR, 2011

Improving the scalability of parallel N-body applications with an event driven constraint based execution model
CoRR, 2011

2010
What Is the Price of Simplicity? - A Cross-Platform Evaluation of the SAGA API.
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

2009
ParalleX.
Proceedings of the ICPPW 2009, 2009

Programming Abstractions for Data Intensive Computing on Clouds and Grids.
Proceedings of the 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, 2009

2008
Towards an integrated GIS-based coastal forecast workflow.
Concurr. Comput. Pract. Exp., 2008

Near realtime visualization of coastal modelling results with WMS and Google Maps.
Proceedings of the 15th ACM Mardi Gras conference: From lightweight mash-ups to lambda grids: Understanding the spectrum of distributed computing requirements, applications, tools, infrastructures, interoperability, and the incremental adoption of key capabilities, Baton Rouge, Louisiana, USA, January 29, 2008

2007
Grid Interoperability at the Application Level Using SAGA.
Proceedings of the Third International Conference on e-Science and Grid Computing, 2007

Design and Implementation of Network Performance Aware Applications Using SAGA and Cactus.
Proceedings of the Third International Conference on e-Science and Grid Computing, 2007

2006
Distributed and collaborative visualization of large data sets using high-speed networks.
Future Gener. Comput. Syst., 2006

Poster reception - Utilizing grid computing technologies for advanced reservoir studies.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Poster reception - The SAGA C++ reference implementation: a milestone toward new high-level grid applications.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Seamless Integration of Generic Bulk Operations in Grid Applications.
Proceedings of the On the Move to Meaningful Internet Systems 2006: OTM 2006 Workshops, 2006

Generic support for bulk operations in grid applications.
Proceedings of the 4th international workshop on Middleware for grid computing, 2006

Versioning and Consistency in Replica Systems.
Proceedings of the Frontiers of High Performance Computing and Networking, 2006

2005
The Grid Application Toolkit: Toward Generic and Easy Application Programming Interfaces for the Grid.
Proc. IEEE, 2005


  Loading...