Rudolf Eigenmann

Orcid: 0000-0003-1651-827X

Affiliations:
  • Purdue University, West Lafayette, USA


According to our database1, Rudolf Eigenmann authored at least 160 papers between 1988 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Should AI Optimize Your Code? A Comparative Study of Current Large Language Models Versus Classical Optimizing Compilers.
CoRR, 2024

Best Practices for Developing Computational and Data-Intensive (CDI) Applications.
CoRR, 2024

Recurrence Analysis for Automatic Parallelization of Subscripted Subscripts.
Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

2023
CaRV - Accelerating Program Optimization through Capture, Replay, Validate.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

2022
A Portal for High-Precision Atomic Data and Computation: Design and Best Practices.
CoRR, 2022

A comparison between Automatically versus Manually Parallelized NAS Benchmarks.
CoRR, 2022

Exchanging Best Practices for Supporting Computational and Data-Intensive Research, The Xpert Network.
Proceedings of the PEARC '22: Practice and Experience in Advanced Research Computing, Boston, MA, USA, July 10, 2022

Learning from Automatically Versus Manually Parallelized NAS Benchmarks.
Proceedings of the Languages and Compilers for Parallel Computing, 2022

2021
Exchanging Best Practices and Tools for Supporting Computational and Data-Intensive Research, The Xpert Network.
CoRR, 2021

iCetus: A Semi-automatic Parallel Programming Assistant.
Proceedings of the Languages and Compilers for Parallel Computing, 2021

On the automatic parallelization of subscripted subscript patterns using array property analysis.
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021

2020
Compile-time Parallelization of Subscripted Subscript Patterns.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

2019
Pagoda: A GPU Runtime System for Narrow Tasks.
ACM Trans. Parallel Comput., 2019

RegDem: Increasing GPU Performance via Shared Memory Register Spilling.
CoRR, 2019

Comparative analysis of coprocessors.
Concurr. Comput. Pract. Exp., 2019

Optimizing GPU programs by register demotion: poster.
Proceedings of the 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2019

2018
National Strategic Computing Initiative.
Comput. Sci. Eng., 2018

2017
Pagoda: Fine-Grained GPU Resource Virtualization for Narrow Tasks.
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2017

HiPA: history-based piecewise approximation for functions.
Proceedings of the International Conference on Supercomputing, 2017

2016
Formalizing Structured Control Flow Graphs.
Proceedings of the Languages and Compilers for Parallel Computing, 2016

Harnessing Parallelism in Multicore Systems to Expedite and Improve Function Approximation.
Proceedings of the Languages and Compilers for Parallel Computing, 2016

POSTER: Pagoda: A Runtime System to Maximize GPU Utilization in Data Parallel Tasks with Limited Parallelism.
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016

2015
PETRA: Performance Evaluation Tool for Modern Parallelizing Compilers.
Int. J. Parallel Program., 2015

HYDRA : Extending Shared Address Programming for Accelerator Clusters.
Proceedings of the Languages and Compilers for Parallel Computing, 2015

HeteroDoop: A MapReduce Programming System for Accelerator Clusters.
Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015

2014
Reliable and Efficient Distributed Checkpointing System for Grid Environments.
J. Grid Comput., 2014

\pi Abstraction: Parallelism-Aware Array Data Flow Analysis for OpenMP.
Proceedings of the Languages and Compilers for Parallel Computing, 2014

2013
McrEngine: A scalable checkpointing system using data-aware aggregation and compression.
Sci. Program., 2013

Compiler Infrastructure.
Int. J. Parallel Program., 2013

The Cetus Source-to-Source Compiler Infrastructure: Overview and Evaluation.
Int. J. Parallel Program., 2013

OpenMPC: extended OpenMP for efficient programming and tuning on GPUs.
Int. J. Comput. Sci. Eng., 2013

Scaling large-data computations on multi-GPU accelerators.
Proceedings of the International Conference on Supercomputing, 2013

2012
Portable section-level tuning of compiler parallelized applications.
Proceedings of the SC Conference on High Performance Computing Networking, 2012

A hybrid approach of OpenMP for clusters.
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2012

Effects of Compiler Optimizations in OpenMP to CUDA Translation.
Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

Topic 11: Multicore and Manycore Programming.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

2011
Polaris.
Proceedings of the Encyclopedia of Parallel Computing, 2011

The NEEShub Cyberinfrastructure for Earthquake Engineering.
Comput. Sci. Eng., 2011

Computational Earthquake and Tsunami Research.
Comput. Sci. Eng., 2011

A Study of the Usefulness of Producer/Consumer Synchronization.
Proceedings of the Languages and Compilers for Parallel Computing, 2011

Automatic Scaling of OpenMP Beyond Shared Memory.
Proceedings of the Languages and Compilers for Parallel Computing, 2011

Performance Analysis and Tuning of Automatically Parallelized OpenMP Applications.
Proceedings of the OpenMP in the Petascale Era - 7th International Workshop on OpenMP, 2011

2010
OpenMPC: Extended OpenMP Programming and Tuning for GPUs.
Proceedings of the Conference on High Performance Computing Networking, 2010

2009
Guest Editors' Introduction.
Int. J. Parallel Program., 2009

Cetus: A Source-to-Source Compiler Infrastructure for Multicores.
Computer, 2009

FALCON: a system for reliable checkpoint recovery in shared grid environments.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

OpenMP to GPGPU: a compiler framework for automatic translation and optimization.
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

Automatically Tuning Parallel and Parallelized Programs.
Proceedings of the Languages and Compilers for Parallel Computing, 2009

2008
Dynamic Resource Management in Energy Constrained Heterogeneous Computing Systems Using Voltage Scaling.
IEEE Trans. Parallel Distributed Syst., 2008

PEAK - a fast and effective performance tuning system via compiler optimization orchestration.
ACM Trans. Program. Lang. Syst., 2008

Measuring High-Performance Computing with Real Applications.
Comput. Sci. Eng., 2008

Incorporation of OpenMP Memory Consistency into Conventional Dataflow Analysis.
Proceedings of the OpenMP in a New Era of Parallelism, 4th International Workshop, 2008

Efficient content search in iShare, a P2P based Internet-sharing system.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Adaptive tuning in a dynamically changing resource environment.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Optimizing irregular shared-memory applications for clusters.
Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

Adaptive runtime tuning of parallel sparse matrix-vector multiplication on distributed memory systems.
Proceedings of the 22nd Annual International Conference on Supercomputing, 2008

Application of Automatic Parallelization to Modern Challenges of Scientific Computing Industries.
Proceedings of the 2008 International Conference on Parallel Processing, 2008

2007
Prediction of Resource Availability in Fine-Grained Cycle Sharing Systems Empirical Evaluation.
J. Grid Comput., 2007

Speculative thread decomposition through empirical optimization.
Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2007

Open Internet-based Sharing for Desktop Grids in iShare.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Programming Distributed Memory Systems Using OpenMP.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Failure-aware checkpointing in fine-grained cycle sharing systems.
Proceedings of the 16th International Symposium on High-Performance Distributed Computing (HPDC-16 2007), 2007

2006
Exploiting reference idempotency to reduce speculative storage overflow.
ACM Trans. Program. Lang. Syst., 2006

Optimizing irregular shared-memory applications for distributed-memory systems.
Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2006

Context-sensitive domain-independent algorithm composition and selection.
Proceedings of the ACM SIGPLAN 2006 Conference on Programming Language Design and Implementation, 2006

Can Transactions Enhance Parallel Programs?
Proceedings of the Languages and Compilers for Parallel Computing, 2006

Implementing Tomorrow's Programming Languages.
Proceedings of the Parallel and Distributed Processing and Applications, 2006

Executing MPI programs on virtual machines in an Internet sharing system.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Empirical Studies on the Behavior of Resource Availability in Fine-Grained Cycle Sharing Systems.
Proceedings of the 2006 International Conference on Parallel Processing (ICPP 2006), 2006

Resource Availability Prediction in Fine-Grained Cycle Sharing Systems.
Proceedings of the 15th IEEE International Symposium on High Performance Distributed Computing, 2006

Fast and Effective Orchestration of Compiler Optimizations for Automatic Performance Tuning.
Proceedings of the Fourth IEEE/ACM International Symposium on Code Generation and Optimization (CGO 2006), 2006

Fast, automatic, procedure-level performance tuning.
Proceedings of the 15th International Conference on Parallel Architectures and Compilation Techniques (PACT 2006), 2006

2005
Interprocedural Symbolic Range Propagation for Optimizing Compilers.
Proceedings of the Languages and Compilers for Parallel Computing, 2005

On the Interaction of Tiling and Automatic Parallelization.
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005

Dynamic Mapping in Energy Constrained Heterogeneous Computing Systems.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005

Towards automatic translation of OpenMP to MPI.
Proceedings of the 19th Annual International Conference on Supercomputing, 2005

iShare - Open Internet Sharing Built on Peer-to-Peer and Web.
Proceedings of the Advances in Grid Computing, 2005

2004
SPEC HPG benchmarks for high-performance systems.
Int. J. High Perform. Comput. Netw., 2004

Rating Compiler Optimizations for Automatic Performance Tuning.
Proceedings of the ACM/IEEE SC2004 Conference on High Performance Networking and Computing, 2004

Min-cut program decomposition for thread-level speculation.
Proceedings of the ACM SIGPLAN 2004 Conference on Programming Language Design and Implementation 2004, 2004

Combined compile-time and runtime-driven, pro-active data movement in software DSM systems.
Proceedings of the 7th Workshop on languages, 2004

Experiences in Using Cetus for Source-to-Source Transformations.
Proceedings of the Languages and Compilers for High Performance Computing, 2004

Data forwarding through in-memory precomputation threads.
Proceedings of the 18th Annual International Conference on Supercomputing, 2004

Protection mechanisms for application service hosting platforms.
Proceedings of the 4th IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid 2004), 2004

Decentralized and Hierarchical Discovery of Software Applications in the iShare Internet Sharing System.
Proceedings of the ISCA 17th International Conference on Parallel and Distributed Computing Systems, 2004

2003
Quantitative performance analysis of the SPEC OMPM2001 benchmarks.
Sci. Program., 2003

Large System Performance of SPEC OMP Benchmark Suites.
Int. J. Parallel Program., 2003

Optimizing OpenMP Programs on Software Distributed Shared Memory Systems.
Int. J. Parallel Program., 2003

Supporting Realistic OpenMP Applications on a Commodity Cluster of Workstations.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2003

Cetus - An Extensible Compiler Infrastructure for Source-to-Source Transformation.
Proceedings of the Languages and Compilers for Parallel Computing, 2003

1D performance analysis and tracing of technical and Java applications on the Itanium2 processor.
Proceedings of the 2003 IEEE International Symposium on Performance Analysis of Systems and Software, 2003

SPEC HPG Benchmarks for Large Systems.
Proceedings of the High Performance Computing, 5th International Symposium, 2003

Topic Introduction.
Proceedings of the Euro-Par 2003. Parallel Processing, 2003

2002
Performance Analysis of Symbolic Analysis Techniques for Parallelizing Compilers.
Proceedings of the Languages and Compilers for Parallel Computing, 15th Workshop, 2002

Large System Performance of SPEC OMP2001 Benchmarks.
Proceedings of the High Performance Computing, 4th International Symposium, 2002

SPEC HPC2002: The Next High-Performance Computer Benchmark.
Proceedings of the High Performance Computing, 4th International Symposium, 2002

Towards OpenMP Execution on Software Distributed Shared Memory Systems.
Proceedings of the High Performance Computing, 4th International Symposium, 2002

Is OpenMP for Grids?.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

Workshop Introduction.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

2001
Parallel programming environment for OpenMP.
Sci. Program., 2001

Performance characteristics of the SPEC OMP2001 benchmarks.
SIGARCH Comput. Archit. News, 2001

Parallel programming with message passing and directives.
Comput. Sci. Eng., 2001

Portable Compilers for OpenMP.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2001

SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2001

High-level adaptive program optimization with ADAPT.
Proceedings of the 2001 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPOPP'01), 2001

Reference idempotency analysis: a framework for optimizing speculative execution.
Proceedings of the 2001 ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPOPP'01), 2001

The Structure of a Compiler for Explicit and Implicit Parallelism.
Proceedings of the Languages and Compilers for Parallel Computing, 2001

Multiplex: unifying conventional and speculative thread-level parallelism on a chip multiprocessor.
Proceedings of the 15th international conference on Supercomputing, 2001

2000
Where Does the Speedup Go: Quantitative Modeling of Performance Losses in Shared-Memory Programs.
Parallel Process. Lett., 2000

A network-computing infrastructure for tool experimentation applied to computer architecture education.
Proceedings of the 2000 workshop on Computer architecture education, 2000

Towards an Integrated, Web-executable Parallel Programming Tool Environment.
Proceedings of the Proceedings Supercomputing 2000, 2000

Interactive Instrumentation and Tuning of OpenMP Programs.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

Interactive and Modular Optimization with InterPol.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 2000

A Performance Advisor Tool for Shared-Memory Parallel Programming.
Proceedings of the Languages and Compilers for Parallel Computing, 2000

Quantifying Differences between OpenMP and MPI Using a Large-Scale Application Suite.
Proceedings of the High Performance Computing, Third International Symposium, 2000

ADAPT: Automated De-Coupled Adaptive Program Transformation.
Proceedings of the 2000 International Conference on Parallel Processing, 2000

Compiler Techniques for Energy Saving in Instruction Caches of Speculative Parallel Microarchitectures.
Proceedings of the 2000 International Conference on Parallel Processing, 2000

A framework for remote dynamic program optimization.
Proceedings of ACM SIGPLAN Workshop on Dynamic and Adaptive Compilation and Optimization (Dynamo 2000), 2000

1999
Issues and Approaches in Parallel Multi-Component and Multi-Physics Simulations.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999

Compiling for Speculative Architectures.
Proceedings of the Languages and Compilers for Parallel Computing, 1999

Dynamically Adaptive Parallel Programs.
Proceedings of the High Performance Computing, Second International Symposium, 1999

Reducing Parallel Overheads Through Dynamic Serialization.
Proceedings of the 13th International Parallel Processing Symposium / 10th Symposium on Parallel and Distributed Processing (IPPS / SPDP '99), 1999

1998
On the Automatic Parallelization of the Perfect Benchmarks.
IEEE Trans. Parallel Distributed Syst., 1998

Nonlinear and Symbolic Data Dependence Testing.
IEEE Trans. Parallel Distributed Syst., 1998

Compiler-Based Tools for Analyzing Parallel Programs.
Parallel Comput., 1998

Parallel Programming and Performance Evaluation with the URSA Tool Family.
Int. J. Parallel Program., 1998

Automatic Parallelization of C by Means of Language Transcription.
Proceedings of the Languages and Compilers for Parallel Computing, 1998

Performance Forecasting: Towards a Methodology for Characterizing Large Computational Applications.
Proceedings of the 1998 International Conference on Parallel Processing (ICPP '98), 1998

URSA MAJOR: Exploring Web Technology for Design and Evaluation of High-Performance Systems.
Proceedings of the High-Performance Computing and Networking, 1998

On the Implementation of Broadcast, Scatter and Gather in a Heterogeneous Architecture.
Proceedings of the Thirty-First Annual Hawaii International Conference on System Sciences, 1998

1997
Changing Interaction of Compiler and Architecture.
Computer, 1997

Implicit Finite Element Applications: A Case for Matching the Number of Processors to the Dynamics of the Program Execution.
Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing, 1997

Interactive Compilation and Performance Analysis with URSA MINOR.
Proceedings of the Languages and Compilers for Parallel Computing, 1997

Towards the Design of a Heterogeneous Hierarchical Machine: A Simulation Approach.
Proceedings of the Proceedings 30st Annual Simulation Symposium (SS '97), April 7-9, 1997, 1997

1996
Parallel Programming with Polaris.
Computer, 1996

Are Parallel Workstations the Right Target for Parallelizing Compilers?
Proceedings of the Languages and Compilers for Parallel Computing, 1996

Portable Parallel Programming Languages.
Proceedings of the 1996 International Conference on Parallel Processing Workshop, 1996

Restructuring Programs for High-Speed Computers with Polaris.
Proceedings of the 1996 International Conference on Parallel Processing Workshop, 1996

1995
Parallel Performance of a Combustion Chemistry Simulation.
Sci. Program., 1995

Automatic Program Restructuring for Parallel Computing and the Polaris Fortran Translator.
Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing, 1995

Demand-Driven, Symbolic Range Propagation.
Proceedings of the Languages and Compilers for Parallel Computing, 1995

Symbolic range propagation.
Proceedings of IPPS '95, 1995

Idiom Recognition in the Polaris Parallelizing Compiler.
Proceedings of the 9th international conference on Supercomputing, 1995

1994
Automatic Detection of Parallelism: A grand challenge for high performance computing.
IEEE Parallel Distributed Technol. Syst. Appl., 1994

The range test: a dependence test for symbolic, non-linear expressions.
Proceedings of the Proceedings Supercomputing '94, 1994

Polaris: Improving the Effectiveness of Parallelizing Compilers.
Proceedings of the Languages and Compilers for Parallel Computing, 1994

An Overview of Symbolic Analysis Techniques Needed for the Effective Parallelization of the Perfect Benchmarks.
Proceedings of the 1994 International Conference on Parallel Processing, 1994

1993
The M<sup>3</sup> Multiprocessor Laboratory.
IEEE Trans. Parallel Distributed Syst., 1993

Automatic program parallelization.
Proc. IEEE, 1993

Restructuring Fortran programs for Cedar.
Concurr. Pract. Exp., 1993



Toward a Methodology of Optimizing Programs for High-Performance Computers.
Proceedings of the 7th international conference on Supercomputing, 1993

1992
Performance Analysis of Parallelizing Compilers on the Perfect Benchmarks Programs.
IEEE Trans. Parallel Distributed Syst., 1992

1991
Parallelization and Performance of Conjugate Gradient Algorithms on the Cedar Hierarchical-Memory Multiprocessor.
Proceedings of the Third ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1991

Experience in the Automatic Parallelization of Four Perfect-Benchmark Programs.
Proceedings of the Languages and Compilers for Parallel Computing, 1991

An Effectiveness Study of Parallelizing Compiler Techniques.
Proceedings of the International Conference on Parallel Processing, 1991

1990
Cedar Fortran and Its Compiler.
Proceedings of the CONPAR 90, 1990

1988
Programmieren, konfigurieren, Mensch-Maschine-Dialog: e. Beitr. zu e. integrierten Programmentwicklungsumgebung für Multiprozessoren.
PhD thesis, 1988

Making a Compiler Easily Portable.
IEEE Softw., 1988


  Loading...