Barbara M. Chapman

Orcid: 0000-0001-8449-8579

Affiliations:
  • Stony Brook University, NY, USA
  • University of Houston, USA (former)


According to our database1, Barbara M. Chapman authored at least 234 papers between 1990 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Quantum optimization algorithms: Energetic implications.
Concurr. Comput. Pract. Exp., July, 2024

Evaluation of Directive-Based Programming Models for Stencil Computation on Current GPGPU Architectures.
Proceedings of the Advancing OpenMP for Future Accelerators, 2024

Towards a Scalable and Efficient PGAS-Based Distributed OpenMP.
Proceedings of the Advancing OpenMP for Future Accelerators, 2024

ParaGraph: Weighted Graph Representation for Performance Optimization of HPC Kernels.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023
GPU First - Execution of Legacy CPU Codes on GPUs.
CoRR, 2023

OpenMP Advisor.
CoRR, 2023

OpenMP Kernel Language Extensions for Performance Portable GPU Codes.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation.
Proceedings of the 2nd International Workshop on Extreme Heterogeneity Solutions, 2023

MPI-based Remote OpenMP Offloading: A More Efficient and Easy-to-use Implementation.
Proceedings of the 14th International Workshop on Programming Models and Applications for Multicores and Manycores, 2023

Exploring the Limits of Generic Code Execution on GPUs via Direct (OpenMP) Offload.
Proceedings of the OpenMP: Advanced Task-Based, Device and Compiler Programming, 2023

OpenMP Advisor: A Compiler Tool for Heterogeneous Architectures.
Proceedings of the OpenMP: Advanced Task-Based, Device and Compiler Programming, 2023

Maximizing Parallelism and GPU Utilization For Direct GPU Compilation Through Ensemble Execution.
Proceedings of the 52nd International Conference on Parallel Processing Workshops, 2023

Implementing OpenMP's SIMD Directive in LLVM's GPU Runtime.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

Performance Study on CPU-based Machine Learning with PyTorch.
Proceedings of the HPC Asia 2023 Workshops, 2023

2022
OpenMP application experiences: Porting to accelerated nodes.
Parallel Comput., 2022

Extending OpenMP and OpenSHMEM for Efficient Heterogeneous Computing.
Proceedings of the IEEE/ACM Parallel Applications Workshop: Alternatives To MPI+X, 2022

Direct GPU Compilation and Execution for Host Applications with OpenMP Parallelism.
Proceedings of the Eighth IEEE/ACM Workshop on the LLVM Compiler Infrastructure in HPC, 2022

Just-in-Time Compilation and Link-Time Optimization for OpenMP Target Offloading.
Proceedings of the OpenMP in a Modern World: From Multi-device Support to Meta Programming, 2022

Towards Efficient Remote OpenMP Offloading.
Proceedings of the OpenMP in a Modern World: From Multi-device Support to Meta Programming, 2022

COMPOFF: A Compiler Cost model using Machine Learning to predict the Cost of OpenMP Offloading.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022

Co-Designing an OpenMP GPU Runtime and Optimizations for Near-Zero Overhead Execution.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

Bring the BitCODE-Moving Compute and Data in Distributed Heterogeneous Systems.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

Efficient Execution of OpenMP on GPUs.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2022

Towards neural architecture-aware exploration of compiler optimizations in a deep learning {graph} compiler.
Proceedings of the CF '22: 19th ACM International Conference on Computing Frontiers, Turin, Italy, May 17, 2022

2021
Comparing the behavior of OpenMP Implementations with various Applications on two different Fujitsu A64FX platforms.
Proceedings of the PEARC '21: Practice and Experience in Advanced Research Computing, 2021


OpenSHMEM Active Message Extension for Task-Based Programming.
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart Networks, 2021

HPCFAIR: Enabling FAIR AI for HPC Applications.
Proceedings of the IEEE/ACM Workshop on Machine Learning in High Performance Computing Environments, 2021

Experience Report: Writing a Portable GPU Runtime with OpenMP 5.1.
Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021

Comparing OpenMP Implementations with Applications Across A64FX Platforms.
Proceedings of the OpenMP: Enabling Massive Node-Level Parallelism, 2021



OpenSHMEM Checker - A Clang Based Static Checker for OpenSHMEM.
Proceedings of the 20th International Symposium on Parallel and Distributed Computing, 2021

Combining Static and Dynamic Analysis to Query Characteristics of HPC Applications.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

Performance Evaluation of Deep Learning Compilers for Edge Inference.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

A Virtual GPU as Developer-Friendly OpenMP Offload Target.
Proceedings of the ICPP Workshops 2021: 50th International Conference on Parallel Processing, 2021

A64FX performance: experience on Ookami.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Enhancing DataRaceBench for Evaluating Data Race Detection Tools.
Proceedings of the 4th IEEE/ACM International Workshop on Software Correctness for HPC Applications, 2020

Checkpointing OpenSHMEM Programs Using Compiler Analysis.
Proceedings of the 10th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, 2020

Concurrent Execution of Deferred OpenMP Target Tasks with Hidden Helper Threads.
Proceedings of the Languages and Compilers for Parallel Computing, 2020

Evaluating Performance of OpenMP Tasks in a Seismic Stencil Application.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

Data Transfer and Reuse Analysis Tool for GPU-Offloading Using OpenMP.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

Toward Supporting Multi-GPU Targets via Taskloop and User-Defined Schedules.
Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

2019
Performance and energy impact of OpenMP runtime configurations on power constrained systems.
Sustain. Comput. Informatics Syst., 2019

Enabling Low-Overhead Communication in Multi-threaded OpenSHMEM Applications using Contexts.
Proceedings of the 2019 IEEE/ACM Parallel Applications Workshop, Alternatives To MPI, 2019

Compiler assisted hybrid implicit and explicit GPU memory management under unified address space.
Proceedings of the International Conference for High Performance Computing, 2019

Kernel Fusion/Decomposition for Automatic GPU-Offloading.
Proceedings of the IEEE/ACM International Symposium on Code Generation and Optimization, 2019

2018
The OpenACC data model: Preliminary study on its major challenges and implementations.
Parallel Comput., 2018

Is Data Placement Optimization Still Relevant on Newer GPUs?
Proceedings of the 2018 IEEE/ACM Performance Modeling, 2018

SHCOLL - A Standalone Implementation of OpenSHMEM-Style Collectives API.
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Extreme Heterogeneity, 2018

Manage OpenMP GPU Data Environment Under Unified Address Space.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

Mapping OpenMP to a Distributed Tasking Runtime.
Proceedings of the Evolving OpenMP for Evolving Architectures, 2018

2017
Performance portability in reverse time migration and seismic modelling via OpenACC.
Int. J. High Perform. Comput. Appl., 2017

An Efficient Data Layout Transformation Algorithm for Locality-Aware Parallel Sparse FFT.
Proceedings of the Seventh Workshop on Irregular Applications: Architectures and Algorithms, 2017

Benchmarking and Evaluating Unified Memory for OpenMP GPU Offloading.
Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPC, 2017

Assessing One-to-One Parallelism Levels Mapping for OpenMP Offloading to GPUs.
Proceedings of the 8th International Workshop on Programming Models and Applications for Multicores and Manycores, 2017

Implementing the OpenACC Data Model.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

A detailed analysis of OpenMP runtime configurations for power constrained systems.
Proceedings of the Eighth International Green and Sustainable Computing Conference, 2017

2016
Compiler transformation of nested loops for general purpose GPUs.
Concurr. Comput. Pract. Exp., 2016

An Analytical Model-Based Auto-tuning Framework for Locality-Aware Loop Scheduling.
Proceedings of the High Performance Computing - 31st International Conference, 2016

A Scalable Task Parallelism Approach for LU Decomposition with Multicore CPUs.
Proceedings of the Second International Workshop on Extreme Scale Programming Models and Middleware, 2016

Towards Automatic HBM Allocation Using LLVM: A Case Study with Knights Landing.
Proceedings of the Third Workshop on the LLVM Compiler Infrastructure in HPC, 2016

Increasing Computational Asynchrony in OpenSHMEM with Active Messages.
Proceedings of the OpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments, 2016

cusFFT: A High-Performance Sparse Fast Fourier Transform Algorithm on GPUs.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Optimizing GPU Register Usage: Extensions to OpenACC and Compiler Optimizations.
Proceedings of the 45th International Conference on Parallel Processing, 2016

One-Sided Interface for Matrix Operations Using MPI-3 RMA: A Case Study with Elemental.
Proceedings of the 45th International Conference on Parallel Processing, 2016

Exploring Task Parallelism for Heterogeneous Systems Using Multicore Task Management API.
Proceedings of the Euro-Par 2016: Parallel Processing Workshops, 2016

ARCS: Adaptive Runtime Configuration Selection for Power-Constrained OpenMP Applications.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

A Comparative Survey of the HPC and Big Data Paradigms: Analysis and Experiments.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

2015
Multi-GPU Support on Single Node Using Directive-Based Programming Model.
Sci. Program., 2015

Programming Models, Languages, and Compilers for Manycore and Heterogeneous Architectures.
Sci. Program., 2015

LLVM parallel intermediate representation: design and evaluation using OpenSHMEM communications.
Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, 2015

GPU technology applied to reverse time migration and seismic modeling via OpenACC.
Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, 2015

Extending the Strided Communication Interface in OpenSHMEM.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies, 2015

Check-Pointing Approach for Fault Tolerance in OpenSHMEM.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies, 2015

False Sharing Detection in OpenMP Applications Using OMPT API.
Proceedings of the OpenMP: Heterogenous Execution and Data Movements, 2015

OpenMP-MCA: Leveraging Multiprocessor Embedded Systems Using Industry Standards.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Adaptive OpenMP Task Scheduling Using Runtime APIs and Machine Learning.
Proceedings of the 14th IEEE International Conference on Machine Learning and Applications, 2015

Deploying OpenMP Task Parallelism on Multicore Embedded Systems with MCA Task APIs.
Proceedings of the 17th IEEE International Conference on High Performance Computing and Communications, 2015

OpenSHMEM as a Portable Communication Layer for PGAS Models: A Case Study with Coarray Fortran.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

A Team-Based Methodology of Memory Hierarchy-Aware Runtime Support in Coarray Fortran.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

2014
Performance of CPU/GPU compiler directives on ISO/TTI kernels.
Computing, 2014

A Coarray Fortran implementation to support data-intensive application development.
Clust. Comput., 2014

Accelerating Kirchhoff migration on GPU using directives.
Proceedings of the First Workshop on Accelerator Programming using Directives, 2014

Power and energy footprint of openMP programs using OpenMP runtime API.
Proceedings of the 2nd International Workshop on Energy Efficient Supercomputing, 2014

SPEC ACCEL: A Standard Application Suite for Measuring Hardware Accelerator Performance.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

Reduction Operations in Parallel Loops for GPGPUs.
Proceedings of the 2014 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2014

Extending the OpenSHMEM Memory Model to Support User-Defined Spaces.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Native Mode-Based Optimizations of Remote Memory Accesses in OpenSHMEM for Intel Xeon Phi.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Analysis of Energy and Performance of PGAS-based Data Access Patterns.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Fault Tolerance for OpenSHMEM.
Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models, 2014

Extending the OpenSHMEM Analyzer to Perform Synchronization and Multi-valued Analysis.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

Analyzing the Energy and Power Consumption of Remote Memory Accesses in the OpenSHMEM Model.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

Implementing OpenSHMEM Using MPI-3 One-Sided Communication.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

NAS Parallel Benchmarks for GPGPUs Using a Directive-Based Programming Model.
Proceedings of the Languages and Compilers for Parallel Computing, 2014

A Validation Testsuite for OpenACC 1.0.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Predicting Cache Contention for Multithread Applications at Compile Time.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

OpenMP Task Scheduling Analysis via OpenMP Runtime API and Tool Visualization.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Using GPU Shared Memory with a Directive-Based Approach.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

PLC Introduction and Committees.
Proceedings of the 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, 2014

Power Consumption Due to Data Movement in Distributed Programming Models.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013
Experiences Developing the OpenUH Compiler and Runtime Infrastructure.
Int. J. Parallel Program., 2013

KLONOS: Similarity-based planning tool support for porting scientific applications.
Concurr. Comput. Pract. Exp., 2013

Parallel sparse FFT.
Proceedings of the 3rd Workshop on Irregular Applications - Architectures and Algorithms, 2013

Performance Analysis of the NWChem TCE for Different Communication Patterns.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking and Simulation, 2013

libEOMP: a portable OpenMP runtime library based on MCA APIs for embedded systems.
Proceedings of the 2013 PPOPP International Workshop on Programming Models and Applications for Multicores and Manycores, 2013

Portable mapping of openMP to multicore embedded systems using MCA APIs.
Proceedings of the SIGPLAN/SIGBED Conference on Languages, 2013

Compile Time Modeling of Off-Chip Memory Bandwidth for Parallel Loops.
Proceedings of the Languages and Compilers for Parallel Computing, 2013

Compiling a High-Level Directive-Based Programming Model for GPGPUs.
Proceedings of the Languages and Compilers for Parallel Computing, 2013

Porting Applications with OpenMP Using Similarity Analysis.
Proceedings of the Languages and Compilers for Parallel Computing, 2013

Open Source Task Profiling by Extending the OpenMP Runtime API.
Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013

Early Experiences with the OpenMP Accelerator Model.
Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013

A Prototype Implementation of OpenMP Task Dependency Support.
Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013

Exploring Programming Multi-GPUs Using OpenMP and OpenACC-Based Hybrid Model.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Filesystem Aware Scalable I/O Framework for Data-Intensive Parallel Applications.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

PLC Introduction.
Proceedings of the 2013 IEEE International Symposium on Parallel & Distributed Processing, 2013

Improving performance of openSHMEM reference library by portable PE mapping technique.
Proceedings of the International Conference on Supercomputing, 2013

Statistical modeling of power/energy of scientific kernels on a multi-GPU system.
Proceedings of the International Green Computing Conference, 2013

2012
OpenMP parallelism for fluid and fluid-particulate systems.
Parallel Comput., 2012

Performance and Power Characteristics of Matrix Multiplication Algorithms on Multicore and Shared Memory Machines.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Experiences with OpenMP, PGI, HMPP and OpenACC Directives on ISO/TTI Kernels.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Poster: Statistical Power and Energy Modeling of Multi-GPU Kernels.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

A Coarray Fortran Implementation to Support Data-Intensive Application Development.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

An OpenMP 3.1 Validation Testsuite.
Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

Specification and Performance Evaluation of Parallel I/O Interfaces for OpenMP.
Proceedings of the OpenMP in a Heterogeneous World - 8th International Workshop on OpenMP, 2012

Compile-Time Detection of False Sharing via Loop Cost Modeling.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium Workshops & PhD Forum, 2012

A Compiler-Based Tool for Array Analysis in HPC Applications.
Proceedings of the 41st International Conference on Parallel Processing Workshops, 2012

A Similarity-Based Analysis Tool for Porting OpenMP Applications.
Proceedings of the Facing the Multicore-Challenge, 2012

OpenMP.
Fundamentals of Multicore Software Development, 2012

2011
OpenMP.
Proceedings of the Encyclopedia of Parallel Computing, 2011

High performance computing using MPI and OpenMP on multi-core parallel systems.
Parallel Comput., 2011

The International Exascale Software Project roadmap.
Int. J. High Perform. Comput. Appl., 2011

Detecting False Sharing in OpenMP Applications Using the DARWIN Framework.
Proceedings of the Languages and Compilers for Parallel Computing, 2011

A Dynamic Optimization Framework for OpenMP.
Proceedings of the OpenMP in the Petascale Era - 7th International Workshop on OpenMP, 2011

A Runtime Implementation of OpenMP Tasks.
Proceedings of the OpenMP in the Petascale Era - 7th International Workshop on OpenMP, 2011

Experiences with High-Level Programming Directives for Porting Applications to GPUs.
Proceedings of the Facing the Multicore - Challenge II, 2011

Programming Strategies for GPUs and their Power Consumption.
Proceedings of the 2011 International Conference on Parallel Architectures and Compilation Techniques, 2011

2010
Enabling locality-aware computations in OpenMP.
Sci. Program., 2010

An open-source compiler and runtime implementation for Coarray Fortran.
Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, 2010

Introducing OpenSHMEM: SHMEM for the PGAS community.
Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, 2010

2009
Programmability Issues.
Int. J. High Perform. Comput. Appl., 2009

Exploiting global optimizations for openmp programs in the openuh compiler.
Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2009

Simplifying Heterogeneous Embedded Systems Programming Based on OpenMP.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Scalability Evaluation of Barrier Algorithms for OpenMP.
Proceedings of the Evolving OpenMP in an Age of Extreme Parallelism, 2009

Implementing OpenMP on a high performance embedded multicore MPSoC.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Open Source Software Support for the OpenMP Runtime API for Profiling.
Proceedings of the ICPPW 2009, 2009

Introduction.
Proceedings of the Euro-Par 2009 Parallel Processing, 2009

2008
Optimizing Compilers.
Proceedings of the Wiley Encyclopedia of Computer Science and Engineering, 2008

A performance tuning methodology with compiler support.
Sci. Program., 2008

Foreword.
Sci. Program., 2008

Performance Evaluation of a Multi-Zone Application in Different OpenMP Approaches.
Int. J. Parallel Program., 2008

Capturing performance knowledge for automated analysis.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Managing Multicore with OpenMP (Extended Abstract).
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

Open64 compiler infrastructure for emerging multicore/manycore architecture All Symposium Tutorial.
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008

Using OpenMP - portable shared memory parallel programming.
Scientific and engineering computation, MIT Press, ISBN: 978-0-262-53302-7, 2008

2007
Performance modeling of communication and computation in hybrid MPI and OpenMP applications.
Simul. Model. Pract. Theory, 2007

OpenMP Implementation of SPICE3 Circuit Simulator.
Int. J. Parallel Program., 2007

OpenUH: an optimizing, portable OpenMP compiler.
Concurr. Comput. Pract. Exp., 2007

Compiler Support for Efficient Instrumentation.
Proceedings of the Parallel Computing: Architectures, 2007

The Future of OpenMP in the Multi-Core Era.
Proceedings of the Parallel Computing: Architectures, 2007

Enhancing OpenMP and Its Implementation for Programming Multicore Systems.
Proceedings of the Parallel Computing: Architectures, 2007

Towards an Implementation of the OpenMP Collector API.
Proceedings of the Parallel Computing: Architectures, 2007

Performance Evaluation of a Multi-zone Application in Different OpenMP Approaches.
Proceedings of the A Practical Programming Model for the Multi-Core Era, 2007

Parallel Data Flow Analysis for OpenMP Programs.
Proceedings of the A Practical Programming Model for the Multi-Core Era, 2007

Invited Paper: A Compile-time Cost Model for OpenMP.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Strategies and Implementation for Translating OpenMP Code for Clusters.
Proceedings of the High Performance Computing and Communications, 2007

Scientific workflow scheduling in computational grids - Planning, reservation, and data/network-awareness.
Proceedings of the 8th IEEE/ACM International Conference on Grid Computing (GRID 2007), 2007

The Multicore Programming Challenge.
Proceedings of the Advanced Parallel Processing Technologies, 7th International Symposium, 2007

2006
Campus Grids Meet Applications: Modeling, Metascheduling and Integration.
J. Grid Comput., 2006

Scalable Shared Memory Parallel Programming: Will One Size Fit All?.
Proceedings of the 14th Euromicro International Conference on Parallel, 2006

Performance Instrumentation and Compiler Optimizations for MPI/OpenMP Applications.
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2006

Sun Grid Engine Package for OSCAR - A Google Summer Of Code 2005 Project.
Proceedings of the 20th Annual International Symposium on High Performance Computing Systems and Applications (HPCS 2006), 2006

A Feature-Rich Workflow Description Language that Supports Resource Co-allocations.
Proceedings of the High Performance Computing and Grids in Action, 2006

Topic 4: Compilers for High Performance.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Toward Enhancing OpenMP's Work-Sharing Directives.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

2005
Towards a more efficient implementation of OpenMP for clusters via translation to global arrays.
Parallel Comput., 2005

OpenMP.
Parallel Comput., 2005

Evaluating OpenMP on Chip MultiThreading Platforms.
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005

A Grid Authentication System with Revocation Guarantees.
Proceedings of the High Performance Computing, 2005

Addressing credential revocation in grid environments.
Proceedings of the 6th IEEE/ACM International Conference on Grid Computing (GRID 2005), 2005

2004
Towards optimisation of openMP codes for synchronisation and data reuse.
Int. J. High Perform. Comput. Netw., 2004

OpenMP-oriented applications for distributed shared memory architectures.
Concurr. Comput. Pract. Exp., 2004

Efficient Implementation of OpenMP for Clusters with Implicit Data Distribution.
Proceedings of the Shared Memory Parallel Programming with OpenMP, 2004

Dragon: A Static and Dynamic Tool for OpenMP.
Proceedings of the Shared Memory Parallel Programming with OpenMP, 2004

A Tool to Display Array Access Patterns in OpenMP Programs.
Proceedings of the Applied Parallel Computing, 2004

Efficient Translation of OpenMP to Distributed Memory.
Proceedings of the Computational Science, 2004

2003
Introduction.
Sci. Program., 2003

Analyses for the Translation of OpenMP Codes into SPMD Style with Array Privatization.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2003

OpenMP on Distributed Memory via Global Arrays.
Proceedings of the Parallel Computing: Software Technology, 2003

An OGSI-compliant portal for campus grids.
Proceedings of the Enhanced Interoperable Systems. Proceedings of the 10th ISPE International Conference on Concurrent Engineering (ISPE CE 2003), 2003

Asynchronous Execution of OpenMP Code.
Proceedings of the Computational Science - ICCS 2003, 2003

2002
Achieving performance under OpenMP on ccNUMA and software distributed shared memory systems.
Concurr. Comput. Pract. Exp., 2002

Improving the Performance of OpenMP by Array Privatization.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2002

Parallel Application Development with the Hybrid MPI+OpenMP Programming Model.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 9th European PVM/MPI Users' Group Meeting, Linz, Austria, September 29, 2002

Performance Comparisons of Basic OpenMP Constructs.
Proceedings of the High Performance Computing, 4th International Symposium, 2002

Implementing OpenMP Using Dataflow Execution Model for Data Locality and Efficient Parallel Execution.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002

XML-Based Policy Engine Framework for Usage Policy Management in Grids.
Proceedings of the Grid Computing, 2002

2001
Performance Oriented Programming for NUMA Architectures.
Proceedings of the OpenMP Shared Memory Parallel Programming, 2001

Program Development Environment for OpenMP Programs on ccNUMA Architectures.
Proceedings of the Large-Scale Scientific Computing, Third International Conference, 2001

Policy Engine: A Framework for Authorization, Accouting Policy Specification and Evaluation in Grids.
Proceedings of the Grid Computing, 2001

2000
Program Development Tools for Clusters of Shared Memory Multiprocessors.
J. Supercomput., 2000

Special Issue: EWOMP'99 - First European Workshop on OpenMP.
Concurr. Pract. Exp., 2000

Tools for OpenMP application development: the POST project.
Concurr. Pract. Exp., 2000

Compilers for High Performance.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

1999
Multiple data parallelism with HPF and KeLP.
Future Gener. Comput. Syst., 1999

Tools for Development of Programs for a Cluster of Shared Memory Multiprocessors.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999

FITS - A Light-Weight Integrated Programming Environment.
Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

Compilers for High Performance Systems - Introduction.
Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

1998
Software support for advanced applications on distributed memory multiprocessor systems.
PhD thesis, 1998

Interprocedural Array Alignment Analysis.
Proceedings of the High-Performance Computing and Networking, 1998

OpenMP and HPF: Integrating Two Paradigms.
Proceedings of the Euro-Par '98 Parallel Processing, 1998

1997
Vienna-Fortran/HPF Extensions for Sparse and Irregular Problems and Their Compilation.
IEEE Trans. Parallel Distributed Syst., 1997

A Coordination Language for Multidisciplinary Applications.
Sci. Program., 1997

ANALYST: Tool Support for the Migration of Fortran Applications to Parallel Systems.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1997

Combining Inter- and Intradimensional Alignment Analysis to Support Data Distribution.
Proceedings of the High-Performance Computing and Networking, 1997

1995
High Performance Fortran Languages: Advanced applications and their implementation.
Future Gener. Comput. Syst., 1995

High-Level Languages for Parallel Scientific Computing.
Proceedings of the SOFSEM '95, 22nd Seminar on Current Trends in Theory and Practice of Informatics, Milovy, Czech Republic, November 23, 1995

Automatic Parallelization of the AVL FIRE Benchmark for a Distributed-Memory System.
Proceedings of the Applied Parallel Computing, 1995

Data-parallel Language Features for Sparse Codes.
Proceedings of the Languages, 1995

New data-parallel language features for sparse matrix computations.
Proceedings of IPPS '95, 1995

1994
SUPERB and Vienna Fortran.
Parallel Comput., 1994

Extending HPF for Advanced Data-Parallel Applications.
IEEE Parallel Distributed Technol. Syst. Appl., 1994

Extending Vienna Fortran with Task Parallelism.
Proceedings of the Proceedings 1994 International Conference on Parallel and Distributed Systems, 1994

A Software Architecture for Multidisciplinary Applications: Integrating Task and Data Parallelism.
Proceedings of the Parallel Processing: CONPAR 94, 1994

1993
Compiling for distributed-memory systems.
Proc. IEEE, 1993

Dynamic data distributions in Vienna Fortran.
Proceedings of the Proceedings Supercomputing '93, 1993

High Performance Fortran Without Templates: An Alternative Model for Distribution and Alignment.
Proceedings of the Fourth ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1993

Automatic Support for Data Distribution on Distributed Memory Multiprocessor Systems.
Proceedings of the Languages and Compilers for Parallel Computing, 1993

1992
Programming in Vienna Fortran.
Sci. Program., 1992

User Defined Mappings in Vienna Fortran.
Proceedings of the 2nd SIGPLAN Workshop on Languages, Compilers, and Run-Time Environments for Distributed Memory Multiprocessors, Boulder, Colorado, September 30, 1992

Handling Distributed Data in Vienna Fortran Procedures.
Proceedings of the Languages and Compilers for Parallel Computing, 1992

1991
Software Tools for Parallel Program Development.
Proceedings of the Programming Environments for High-Level Scientific Problem Solving, 1991

Knowledge-Based Parallelization for Distributed Memory Systems.
Proceedings of the Parallel Computation, First International ACPC Conference, Salzburg, Austria, September 30, 1991

1990
Supercompilers for parallel and vector computers.
ACM Press frontier series, Addison-Wesley, ISBN: 978-0-201-17560-8, 1990


  Loading...