Thomas Rauber

Orcid: 0000-0002-3102-6858

Affiliations:
  • Universität Bayreuth, Institut für Informatik, Fakultät für Mathematik, Physik und Informatik, Germany


According to our database1, Thomas Rauber authored at least 216 papers between 1990 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Pareto-Optimal Execution of Parallel Applications with Respect to Time and Energy.
Proceedings of the 13th International Conference on Smart Cities and Green ICT Systems, 2024

Message from the PDSEC-24 Workshop Chairs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023
Message from the PDSEC-22 Workshop Chairs.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023

Performance and Energy Evaluation for Solving a Schrödinger-Poisson System on Multicore Processors.
Proceedings of the Computer Performance Engineering and Stochastic Modelling, 2023

Parallel Programming - for Multicore and Cluster Systems, Third Edition
Springer, ISBN: 978-3-031-28923-1, 2023

2021
Autotuning based on frequency scaling toward energy efficiency of blockchain algorithms on graphics processing units.
J. Supercomput., 2021

A performance- and energy-oriented extended tuning process for time-step-based scientific applications.
J. Supercomput., 2021

Modeling the effect of application-specific program transformations on energy and performance improvements of parallel ODE solvers.
J. Comput. Sci., 2021

Data-driven Full-waveform Inversion Surrogate using Conditional Generative Adversarial Networks.
Proceedings of the International Joint Conference on Neural Networks, 2021

YaskSite: Stencil Optimization Techniques Applied to Explicit ODE Methods on Modern Architectures.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2021

2020
Offsite Autotuning Approach - Performance Model Driven Autotuning Applied to Parallel Explicit ODE Methods.
Proceedings of the High Performance Computing - 35th International Conference, 2020

A Parameter Selection Process by Data Analysis for Tuning Multi-threaded Time-Stepping Algorithms.
Proceedings of the 2020 Seventh International Conference on Software Defined Systems, 2020

Workshop 13: PDSEC Parallel and Distributed Scientific and Engineering Computing.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019
Model-based optimization of the energy efficiency of multi-threaded applications.
Sustain. Comput. Informatics Syst., 2019

A scheduling selection process for energy-efficient task execution on DVFS processors.
Concurr. Comput. Pract. Exp., 2019

Performance Prediction of Explicit ODE Methods on Multi-Core Cluster Systems.
Proceedings of the 2019 ACM/SPEC International Conference on Performance Engineering, 2019

Multiprocessor Task Programming and Flexible Load Balancing for Time-Stepping Methods on Heterogeneous Cloud Infrastructures.
Proceedings of the 2019 IEEE SmartWorld, 2019

Enabling Scalability, Adaptivity, and Resilience in Cloud Applications by Software-defined M-Task-based Programming.
Proceedings of the 6th International Conference on Software Defined Systems, 2019

DVFS RK: Performance and Energy Modeling of Frequency-Scaled Multithreaded Runge-Kutta Methods.
Proceedings of the 27th Euromicro International Conference on Parallel, 2019

Introduction to PDSEC-19.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

On the Energy Consumption and Accuracy of Multithreaded Embedded Runge-Kutta Methods.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019


2018
Performance and energy metrics for multi-threaded applications on DVFS processors.
Sustain. Comput. Informatics Syst., 2018

Applicability of the ECM Performance Model to Explicit ODE Methods on Current Multi-core Processors.
Proceedings of the High Performance Computing - 33rd International Conference, 2018

Exploring Self-Adaptivity Towards Performance and Energy for Time-Stepping Methods.
Proceedings of the 30th International Symposium on Computer Architecture and High Performance Computing, 2018

How do Loop Transformations Affect the Energy Consumption of Multi-Threaded Runge-Kutta Methods?
Proceedings of the 26th Euromicro International Conference on Parallel, 2018

Execution Behavior Analysis of Parallel Schemes for Implicit Solution Methods for ODEs.
Proceedings of the 17th International Symposium on Parallel and Distributed Computing, 2018

Introduction to PDSEC 2018 and Keynotes.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Energy and Performance Improvement of Parallel ODE Solvers by Application-Specific Program Transformations.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

On the Autotuning Potential of Time-stepping methods from Scientific Computing.
Proceedings of the 2018 Federated Conference on Computer Science and Information Systems, 2018

2017
Comparison of Time and Energy Oriented Scheduling for Task-Based Programs.
Proceedings of the Parallel Processing and Applied Mathematics, 2017

Towards New Metrics for Appraising Performance and Energy Efficiency of Parallel Scientific Programs.
Proceedings of the 2017 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, 2017

Tuning Energy Effort and Execution Time of Application Software.
Proceedings of the Information Systems Architecture and Technology: Proceedings of 38th International Conference on Information Systems Architecture and Technology - ISAT 2017, 2017

Introduction to PDSEC Workshop.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

2016
PDSEC Introduction and Committees.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Influence of Locality on the Scalability of Method-and System-Parallel Explicit Peer Methods.
Proceedings of the 2016 Federated Conference on Computer Science and Information Systems, 2016

2015
Applications for ultrascale computing.
Supercomput. Front. Innov., 2015

Energy-efficient Algorithms for Ultrascale Systems.
Supercomput. Front. Innov., 2015

MAP: a cloud-based middleware for the provision of fine-grained compute services for mobile streaming image processing applications.
Int. J. Cloud Comput., 2015

Modeling and analyzing the energy consumption of fork-join-based task parallel programs.
Concurr. Comput. Pract. Exp., 2015

PDSEC Introduction and Committees.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

2014
Energy measurement, modeling, and prediction for processors with frequency scaling.
J. Supercomput., 2014

Online auto-tuning for the time-step-based parallel solution of ODEs on shared-memory systems.
J. Parallel Distributed Comput., 2014

Energy measurement and prediction for multi-threaded programs.
Proceedings of the 2014 Spring Simulation Multiconference, 2014

PDSEC Introduction and Committees.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

2013
Programming support and scheduling for communicating parallel tasks.
J. Parallel Distributed Comput., 2013

PDSEC Introduction.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

MAP: Mobile Assistance Platform with a VM Type Selection Ability.
Proceedings of the 19th IEEE International Conference on Parallel and Distributed Systems, 2013

Parallel Programming - for Multicore and Cluster Systems; 2nd Edition.
Springer, ISBN: 978-3-642-37800-3, 2013

2012
Combined scheduling and mapping for scalable computing with parallel tasks.
Sci. Program., 2012

An execution environment for flexible task-oriented software on multicore systems.
Concurr. Eng. Res. Appl., 2012

Analytical modeling and simulation of the energy consumption of independent tasks.
Proceedings of the Winter Simulation Conference, 2012

Energy-Aware Execution of Fork-Join-Based Task Parallelism.
Proceedings of the 20th IEEE International Symposium on Modeling, 2012

Go's Concurrency Constructs on the SCC.
Proceedings of the 6th Many-core Applications Research Community (MARC) Symposium. Proceedings of the 6th MARC Symposium, 2012

PDSEC Introduction.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

Towards an Energy Model for Modular Parallel Scientific Applications.
Proceedings of the 2012 IEEE International Conference on Green Computing and Communications, 2012

Mobile Cloud Computing in 3G Cellular Networks Using Pipelined Tasks.
Proceedings of the Service-Oriented and Cloud Computing - First European Conference, 2012

Resource Allocation for Cloud-Assisted Mobile Applications.
Proceedings of the 2012 IEEE Fifth International Conference on Cloud Computing, 2012

Parallele Programmierung, 3. Auflage.
eXamen.press, Springer, ISBN: 978-3-642-13603-0, 2012

2011
Dynamic selection of implementation variants of sequential iterated runge-kutta methods with tile size sampling (abstracts only).
SIGMETRICS Perform. Evaluation Rev., 2011

An efficient time-step-based self-adaptive algorithm for predictor-corrector methods of Runge-Kutta type.
J. Comput. Appl. Math., 2011

Multicore.
it Inf. Technol., 2011

Adaptive Task Pools: Efficiently Balancing Large Number of Tasks on Shared-address Spaces.
Int. J. Parallel Program., 2011

Parallel Low-Storage Runge - Kutta Solvers for ODE Systems with Limited Access Distance.
Int. J. High Perform. Comput. Appl., 2011

Scalability and locality of extrapolation methods on large parallel systems.
Concurr. Comput. Pract. Exp., 2011

Memory-optimal evaluation of expression trees involving large objects.
Comput. Lang. Syst. Struct., 2011

Dynamic selection of implementation variants of sequential iterated runge-kutta methods with tile size sampling.
Proceedings of the ICPE'11, 2011

Modeling the energy consumption for concurrent executions of parallel tasks.
Proceedings of the 2011 Spring Simulation Multi-conference, 2011

Component-based programming techniques for coarse-grained parallelism.
Proceedings of the 2011 Spring Simulation Multi-conference, 2011

Fast approximation algorithms for scheduling independent multiprocessor tasks.
Proceedings of the 2011 Spring Simulation Multi-conference, 2011

Task Parallelism on the SCC.
Proceedings of the 3rd Many-core Applications Research Community (MARC) Symposium. Proceedings of the 3rd MARC Symposium, 2011

Scheduling Support for Communicating Parallel Tasks.
Proceedings of the Languages and Compilers for Parallel Computing, 2011

Semi-dynamic Scheduling of Parallel Tasks for Heterogeneous Clusters.
Proceedings of the 10th International Symposium on Parallel and Distributed Computing, 2011

PDSEC Introduction.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

Memory-Intensive Applications on a Many-Core Processor.
Proceedings of the 13th IEEE International Conference on High Performance Computing & Communication, 2011

2010
Dynamic Task Scheduling and Load Balancing on Cell Processors.
Proceedings of the 18th Euromicro Conference on Parallel, 2010

Storage space reduction for the solution of systems of ordinary differential equations by pipelining and overlapping of vectors.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Message from the PDSEC-10 workshop chairs.
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

Mixed-Parallel Implementations of Extrapolation Methods with Reduced Synchronization Overhead for Large Shared-Memory Computers.
Proceedings of the 16th IEEE International Conference on Parallel and Distributed Systems, 2010

Adaptive Execution of Software Systems on Parallel Multicore Architectures.
Proceedings of the ICEIS 2010 - Proceedings of the 12th International Conference on Enterprise Information Systems, Volume 3, ISAS, Funchal, Madeira, Portugal, June 8, 2010

Scalability and Locality of Extrapolation Methods for Distributed-Memory Architectures.
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

Theory and Algorithms for Parallel Computation.
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

Exploiting Fine-Grained Parallelism on Cell Processors.
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010

Combining Object-Oriented Design and SOA with Remote Objects over Web Services.
Proceedings of the 8th IEEE European Conference on Web Services (ECOWS 2010), 2010

Software Architectures for Flexible Task-Oriented Program Execution on Multicore Systems.
Proceedings of the Complex Systems Design & Management, 2010

BPEL Remote Objects: Integrating BPEL Processes into Object-Oriented Applications.
Proceedings of the 2010 IEEE International Conference on Services Computing, 2010

Parallel Programming - for Multicore and Cluster Systems.
Springer, ISBN: 978-3-642-04817-3, 2010

2009
Softwaremodernisierung durch werkzeugunterstütztes Verschieben von Codeblöcken.
Proceedings of the Software Engineering 2009, 2009

Scalable computing with parallel tasks.
Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers, 2009

Parallelization Strategies for ODE Solvers on Multicore Cluster Systems.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Message from the PDSEC-09 workshop chairs.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Reducing the Class Coupling of Legacy Code by a Metrics-Based Relocation of Class Members.
Proceedings of the Advances in Software Engineering Techniques, 2009

Load Balancing Concurrent BPEL Processes by Dynamic Selection of Web Service Endpoints.
Proceedings of the ICPPW 2009, 2009

Scalability of Time- and Space-Efficient Embedded Runge-Kutta Solvers for Distributed Address Space.
Proceedings of the ICPP 2009, 2009

Pattern-Based Refactoring of Legacy Software Systems.
Proceedings of the Enterprise Information Systems, 11th International Conference, 2009

Parallel Implementation of Runge-Kutta Integrators with Low Storage Requirements.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2008
Combining building blocks for parallel multi-level matrix multiplication.
Parallel Comput., 2008

An adaptive extension library for improving collective communication operations.
Concurr. Comput. Pract. Exp., 2008

Inkrementelle Transformation einer monolithischen Geschäftssoftware.
Proceedings of the Software Engineering 2008, 2008

A Transformation Framework for Communicating Multiprocessor-Tasks.
Proceedings of the 16th Euromicro International Conference on Parallel, 2008

Mapping Algorithms for Multiprocessor Tasks on Multi-Core Clusters.
Proceedings of the 2008 International Conference on Parallel Processing, 2008

Models for Parallel Workflow Processing on Multi-Core Architectures.
Proceedings of the ICEIS 2008, 2008

Fine-Grained Task Scheduling Using Adaptive Data Structures.
Proceedings of the Euro-Par 2008, 2008

Transformation of Legacy Software into Client/Server Applications through Pattern-Based Rearchitecturing.
Proceedings of the 32nd Annual IEEE International Computer Software and Applications Conference, 2008

Redistribution aware two-step scheduling for mixed-parallel applications.
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008

Scheduling Dynamic Workflows onto Clusters of Clusters using Postponing.
Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2008), 2008

2007
Mixed task and data parallel executions in general linear methods.
Sci. Program., 2007

Communicating Multiprocessor-Tasks.
Proceedings of the Languages and Compilers for Parallel Computing, 2007

Incremental Transformation of Business Software.
Proceedings of the Enterprise Information Systems, 9th International Conference, 2007

Transformation of legacy business software into client-server architectures.
Proceedings of the ICEIS 2007, 2007

Locality Optimized Shared-Memory Implementations of Iterated Runge-Kutta Methods.
Proceedings of the Euro-Par 2007, 2007

Profiling of Task-Based Applications on Shared Memory Machines: Scalability and Bottlenecks.
Proceedings of the Euro-Par 2007, 2007

Sequential and parallel implementation of a constraint-based algorithm for searching protein structures.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

Dynamic scheduling of multi-processor tasks on clusters of clusters.
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007

Trace-based Automatic Padding for Locality Improvement with Correlative Data Visualization Interface.
Proceedings of the 16th International Conference on Parallel Architectures and Compilation Techniques (PACT 2007), 2007

Parallele Programmierung, 2. Auflage.
eXamen.press, Springer, ISBN: 978-3-540-46549-2, 2007

2006
Optimizing locality and scalability of embedded Runge-Kutta solvers using block-based pipelining.
J. Parallel Distributed Comput., 2006

A Data re-distribution Library for Multi-processor Task Programming.
Int. J. Found. Comput. Sci., 2006

Optimizing MPI collective communication by orthogonal structures.
Clust. Comput., 2006

RCM - A Multi-Layered Reconfigurable Cluster Middleware.
Proceedings of the 14th Euromicro International Conference on Parallel, 2006

Design and Evaluation of a Parallel Data Redistribution Component for TGrid.
Proceedings of the Parallel and Distributed Processing and Applications, 2006

Combining Measures for Temporal and Spatial Locality.
Proceedings of the Frontiers of High Performance Computing and Networking, 2006

Anticipated distributed task scheduling for grid environments.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

A decomposition approach for optimizing the performance of MPI libraries.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Applicability of Load Balancing Strategies to Data-Parallel Embedded Runge-Kutta Integrators.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

TGrid - Grid runtime support for hierarchically structured task-parallel programs.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006

2005
Tlib - a library to support programming with hierarchical multi-processor tasks.
J. Parallel Distributed Comput., 2005

Adaptive Selection of Communication Methods to Optimize Collective MPI Operations.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Reducing the Overhead of Intra-Node Communication in Clusters of SMPs.
Proceedings of the Parallel and Distributed Processing and Applications, 2005

M-Task-Programming for Heterogeneous Systems and Grid Environments.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Automatic Tuning of PDGEMM Towards Optimal Performance.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

2004
Improving locality for ODE solvers by program transformations.
Sci. Program., 2004

Program-Based Locality Measures For Scientific Computing.
Int. J. Found. Comput. Sci., 2004

Group-SPMD programming with orthogonal processor groups.
Concurr. Comput. Pract. Exp., 2004

A comparison of task pools for dynamic load balancing of irregular algorithms.
Concurr. Comput. Pract. Exp., 2004

Performance Evaluation of Task Pools Based on Hardware Synchronization.
Proceedings of the ACM/IEEE SC2004 Conference on High Performance Networking and Computing, 2004

Simulation-Based Analysis of Parallel Runge-Kutta Solvers.
Proceedings of the Applied Parallel Computing, 2004

Functional Realization of Coordination Environments for Mixed Parallelism.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

A Source Code Analyzer for Performance Prediction.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

Multilevel hierarchical matrix multiplication on clusters.
Proceedings of the 18th Annual International Conference on Supercomputing, 2004

Using Hardware Operations to Reduce the Synchronization Overhead of Task Pools.
Proceedings of the 33rd International Conference on Parallel Processing (ICPP 2004), 2004

Hierarchical Matrix-Matrix Multiplication Based on Multiprocessor Tasks.
Proceedings of the Computational Science, 2004

Execution Schemes for Parallel Adams Methods.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Improving the execution time of global communication operations.
Proceedings of the First Conference on Computing Frontiers, 2004

Dynamic Loop Scheduling with Processor Groups.
Proceedings of the ISCA 17th International Conference on Parallel and Distributed Computing Systems, 2004

2003
On Compiler Support for Mixed Task and Data Parallelism.
Proceedings of the Parallel Computing: Software Technology, 2003

Comparison of Parallel Implementations of Runge-Kutta Solvers: Message Passing vs. Threads.
Proceedings of the Parallel Computing: Software Technology, 2003

A Distributed Hierarchical Programming Model for Heterogeneous Cluster of SMPs.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Scalable Parallel RK Solvers for ODEs Derived by the Method of Lines.
Proceedings of the Euro-Par 2003. Parallel Processing, 2003

2002
Library support for hierarchical multi-processor tasks.
Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002

Workshop Introduction.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Selecting Data Distributions for Unbounded Loops.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Efficiency of Thread-Parallel Java Programs from Scientific Computing.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Evaluation of Task Pools for the Implementation of Parallel Irregular Algorithms.
Proceedings of the 31st International Conference on Parallel Processing Workshops (ICPP 2002 Workshops), 2002

Pipelining for Locality Improvement in RK Methods.
Proceedings of the Euro-Par 2002, 2002

2001
Library support for orthogonal processor groups.
Proceedings of the Thirteenth Annual ACM Symposium on Parallel Algorithms and Architectures, 2001

ORT: a communication library for orthogonal processor groups.
Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001

A Hierarchical Computation Model for Distributed Shared-Memory Machines.
Proceedings of the Ninth Euromicro Workshop on Parallel and Distributed Processing, 2001

Cyclic Reduction on Distributed Shared Memory Machines.
Proceedings of the Ninth Euromicro Workshop on Parallel and Distributed Processing, 2001

Optimizing locality for ODE solvers.
Proceedings of the 15th international conference on Supercomputing, 2001

Orthogonal Processor Groups for Message-Passing Programs.
Proceedings of the High-Performance Computing and Networking, 9th International Conference, 2001

2000
A Transformation Approach to Derive Efficient Parallel Implementations.
IEEE Trans. Software Eng., 2000

Deriving Array Distributions by Optimization Techniques.
J. Supercomput., 2000

A Side-Effect-Free Hierarchical Radiosity Algorithm.
Proceedings of the Applied Computing 2000, 2000

Modelling the Cache Performance of Sparse Matrix Computations.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

Set Operations for Orthogonal Processor Groups.
Proceedings of the Languages and Compilers for Parallel Computing, 2000

Cost Hierarchies for Abstract Parallel Machines.
Proceedings of the Languages and Compilers for Parallel Computing, 2000

Modelling the Runtime of Scientific Programs on Parallel Computers.
Proceedings of the 2000 International Workshop on Parallel Processing, 2000

Parallele und verteilte Programmierung.
Springer, ISBN: 978-3-540-66009-5, 2000

1999
Scalability Analysis for Conservative Simulation of Logical Circuits.
VLSI Design, 1999

Compiler support for task scheduling in hierarchical execution models.
J. Syst. Archit., 1999

Scalability of Sparse Cholesky Factorization.
Int. J. High Speed Comput., 1999

Diagonal-Implicitly Iterated Runge-Kutta Methods on Distributed Memory Machines.
Int. J. High Speed Comput., 1999

Parallel execution of embedded and iterated Runge-Kutta methods.
Concurr. Pract. Exp., 1999

Matrix Computations Behind the Hierarchical Radiosity Method.
Proceedings of the 1999 ACM Symposium on Applied Computing, 1999

A Coordination Language for Mixed Task and and Data Parallel Programs.
Proceedings of the 1999 ACM Symposium on Applied Computing, 1999

Scheduling of Data Parallel Modules for Scientific Computing.
Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999

Parallel simulation of incompressible fluids.
Proceedings of the Seventh Euromicro Workshop on Parallel and Distributed Processing. PDP'99, 1999

1998
A Shared-Memory Implementation of the Hierarchical Radiosity Method.
Theor. Comput. Sci., 1998

Execution behavior analysis and performance prediction for a shared-memory implementation of an irregular particle simulation method.
Simul. Pract. Theory, 1998

Support for Efficient Programming on the SB-PRAM.
Int. J. Parallel Program., 1998

Modeling the Communication Behavior of Distributed Memory Machines by Genetic Programming.
Proceedings of the Euro-Par '98 Parallel Processing, 1998

1997
Load balancing schemes for extrapolation methods.
Concurr. Pract. Exp., 1997

Integrating library modules into special purpose parallel algorithms.
Proceedings of the International Symposium on Software Engineering for Parallel and Distributed Systems, 1997

Parallel Execution of Embedded Runge-Kutta Methods.
Proceedings of the Parallel Computing: Fundamentals, 1997

Modeling the Communication Behavior of the Intel Paragon.
Proceedings of the MASCOTS 1997, 1997

Scalability of Parallel Sparse Cholesky Factorization.
Proceedings of the Euro-Par '97 Parallel Processing, 1997

1996
Deriving structured parallel implementations for numerical methods.
Microprocess. Microprogramming, 1996

The ADDAP System on the iPSC/860: Automatic Data Distribution and Parallelization.
J. Parallel Distributed Comput., 1996

Parallel Implementations of Iterated Runge-Kutta Methods.
Int. J. High Perform. Comput. Appl., 1996

Scheduling of multiprocessor tasks for numerical applications.
Proceedings of the Eighth IEEE Symposium on Parallel and Distributed Processing, 1996

Conservative Circuit Simulation on Shared-Memory Multiprocessors.
Proceedings of the Tenth Workshop on Parallel and Distributed Simulation, 1996

Shared-Memory Implementation of an Irregular Particle Simulation Method.
Proceedings of the Euro-Par '96 Parallel Processing, 1996

Comparing Task and Data Parallel Execution Schemes for the DIIRK Method.
Proceedings of the Euro-Par '96 Parallel Processing, 1996

Scalability and Granularity Issues of the Hierarchical Radiosity Method.
Proceedings of the Euro-Par '96 Parallel Processing, 1996

The compiler TwoL for the design of parallel implementations.
Proceedings of the Fifth International Conference on Parallel Architectures and Compilation Techniques, 1996

1995
Generating Optimal Contiguous Evaluations for Expression DAGs.
Comput. Lang., 1995

The Programming Environment of the SB-PRAM.
Proceedings of the Seventh IASTED/ISMM International Conference on Parallel and Distributed Computing and Systems, 1995

Iterated Runge-Kutta methods on distributed memory multiprocessors.
Proceedings of the 3rd Euromicro Workshop on Parallel and Distributed Processing (PDP '95), 1995

An Object Oriented Implementation of Distributed Graph-Based Computations.
Proceedings of the Parallel Computing: State-of-the-Art and Perspectives, 1995

Performance predictions for parallel diagonal-implicitly iterated Runge-Kutta methods.
Proceedings of the Ninth Workshop on Parallel and Distributed Simulation, 1995

An application specific parallel programming paradigm.
Proceedings of the High-Performance Computing and Networking, 1995

Parallel solution of a Schrödinger-Poisson system.
Proceedings of the High-Performance Computing and Networking, 1995

Optimal Continguous Expression DAG Evaluations.
Proceedings of the Fundamentals of Computation Theory, 10th International Symposium, 1995

Optimal Data Distributions for LU Decomposition.
Proceedings of the Euro-Par '95 Parallel Processing, 1995

1994
A Branch-and-bound Algorithm For Array Distributions.
Proceedings of the Second Euromicro Workshop on Parallel and Distributed Processing, 1994

Load Balancing for Extraplation Methods on Distributed Memory Multiprocessors.
Proceedings of the PARLE '94: Parallel Architectures and Languages Europe, 1994

Hypercube Implementation and Performance Analysis for Extrapolation Models.
Proceedings of the Parallel Processing: CONPAR 94, 1994

Automatic Parallelization for Distributed Memory Multiprocessors.
Proceedings of the Automatic Parallelization: New Approaches to Code Generation, 1994

1993
Efficient Register Allocation for Large Basic Blocks.
Proceedings of the Programming Language Implementation and Logic Programming, 1993

Algorithmen in der Computergraphik.
Leitfäden und Monographien der Informatik, Teubner, ISBN: 978-3-519-02127-8, 1993

1992
The Shared-Memory Language pSather on a Distributed-Memory Multiprocessor.
Proceedings of the 2nd SIGPLAN Workshop on Languages, Compilers, and Run-Time Environments for Distributed Memory Multiprocessors, Boulder, Colorado, September 30, 1992

1991
A Randomized Heuristic Approach to Register Allocation
Proceedings of the Programming Language Implementation and Logic Programming, 1991

Scheduling Vector Straight Line Code on Vector Processors.
Proceedings of the Code Generation, 1991

1990
Ein Compiler für Vektorrechner mit optimaler Auswertung von vektoriellen Ausdrucksbäumen.
PhD thesis, 1990

Optimal evaluation of vector expression trees.
Proceedings of the Next Decade in Information Technology: Proceedings of the 5th Jerusalem Conference on Information Technology 1990, 1990


  Loading...