Allen D. Malony

Orcid: 0000-0002-9598-7201

Affiliations:
  • University of Oregon, Eugene, USA


According to our database1, Allen D. Malony authored at least 255 papers between 1986 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SOMA: Observability, monitoring, and in situ analytics for exascale applications.
Concurr. Comput. Pract. Exp., August, 2024

Understanding the Impact of openPMD on BIT1, a Particle-in-Cell Monte Carlo Code, through Instrumentation, Monitoring, and In-Situ Analysis.
CoRR, 2024

Integration of Modern HPC Performance Tools in Vlasiator for Exascale Analysis and Optimization.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Enabling Performance Observability for Heterogeneous HPC Workflows with SOMA.
Proceedings of the 53rd International Conference on Parallel Processing, 2024

2023
STaKTAU: profiling HPC applications' operating system usage.
CoRR, 2023

ZeroSum: User Space Monitoring of Resource Utilization and Contention on Heterogeneous HPC Systems.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

2022
MARTINI: The Little Match and Replace Tool for Automatic Code Rewriting.
J. Open Source Softw., 2022

Translating High-Performance Computing Tools From Research to Practice: Experiences With the TAU Performance System.
Comput. Sci. Eng., 2022

MAPredict: Static Analysis Driven Memory Access Prediction Framework for Modern CPUs.
Proceedings of the High Performance Computing - 37th International Conference, 2022

SERVIZ: A Shared In Situ Visualization Service.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Enabling Global MPI Process Addressing in MPI Applications.
Proceedings of the EuroMPI/USA'22: 29th European MPI Users' Group Meeting, Chattanooga, TN, USA, September 26, 2022

Leveraging Compiler-Based Translation to Evaluate a Diversity of Exascale Platforms.
Proceedings of the IEEE/ACM International Workshop on Performance, 2022

MARTINI: The Little Match and Replace Tool for Automatic Application Rewriting with Code Examples.
Proceedings of the Euro-Par 2022: Parallel Processing, 2022

The Ghost of Performance Reproducibility Past.
Proceedings of the 18th IEEE International Conference on e-Science, 2022

HPC Storage Service Autotuning Using Variational- Autoencoder -Guided Asynchronous Bayesian Optimization.
Proceedings of the IEEE International Conference on Cluster Computing, 2022

2021
Optimization with the OpenACC-to-FPGA framework on the Arria 10 and Stratix 10 FPGAs.
Parallel Comput., 2021

Measuring OpenSHMEM Communication Routines with SKaMPI-OpenSHMEM User's manual.
CoRR, 2021

DiPOSH: A portable OpenSHMEM implementation for short API-to-network path.
Concurr. Comput. Pract. Exp., 2021

Artemis: Automatic Runtime Tuning of Parallel Execution Parameters Using Machine Learning.
Proceedings of the High Performance Computing - 36th International Conference, 2021

Comparing LLC-Memory Traffic between CPU and GPU Architectures.
Proceedings of the IEEE/ACM Redefining Scalability for Diversely Heterogeneous Architectures Workshop, 2021

SKaMPI-OpenSHMEM: Measuring OpenSHMEM Communication Routines.
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart Networks, 2021

SYMBIOSYS: A Methodology for Performance Analysis of Composable HPC Data Services.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

SYMBIOMON: A High-Performance, Composable Monitoring Service.
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021

Dynamic and Adaptive Monitoring and Analysis for Many-task Ensemble Computing.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Understanding the Impact of Memory Access Patterns in Intel Processors.
Proceedings of the IEEE/ACM Workshop on Memory Centric High Performance Computing, 2020

CCAMP: an integrated translation and optimization framework for OpenACC and OpenMP.
Proceedings of the International Conference for High Performance Computing, 2020

OpenACC Profiling Support for Clang and LLVM using Clacc and TAU.
Proceedings of the IEEE/ACM International Workshop on HPC User Support Tools and Workshop on Programming and Performance Visualization Tools, 2020

Multi-Platform SYCL Profiling with TAU.
Proceedings of the IWOCL '20: International Workshop on OpenCL, 2020

On-the-fly Optimization of Parallel Computation of Symbolic Symplectic Invariants.
Proceedings of the 19th International Symposium on Parallel and Distributed Computing, 2020

In-Depth Optimization with the OpenACC-to-FPGA Framework on an Arria 10 FPGA.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

Identifying Optimization Opportunities Using Memory Access Tracing in OpenSHMEM Runtimes with the TAU Performance System.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020

MEPHESTO: Modeling Energy-Performance in Heterogeneous SoCs and Their Trade-Offs.
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020

2019
Checkpoint/restart approaches for a thread-based MPI runtime.
Parallel Comput., 2019

Multi-Level Performance Instrumentation for Kokkos Applications Using TAU.
Proceedings of the IEEE/ACM International Workshop on Programming and Performance Visualization Tools, 2019

Peachy Parallel Assignments (EduHPC 2019).
Proceedings of the 2019 IEEE/ACM Workshop on Education for High-Performance Computing, 2019

Mixing ranks, tasks, progress and nonblocking collectives.
Proceedings of the 26th European MPI Users' Group Meeting, 2019

On the Road to DiPOSH: Adventures in High-Performance OpenSHMEM.
Proceedings of the Parallel Processing and Applied Mathematics, 2019

When Parallel Performance Measurement and Analysis Meets In Situ Analytics and Visualization.
Proceedings of the Parallel Computing: Technology Trends, 2019

Understanding the Impact of Dynamic Power Capping on Application Progress.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Towards Runtime Analytics in a Parallel Performance System.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

Runtime Adaptive Task Inlining on Asynchronous Multitasking Runtime Systems.
Proceedings of the 48th International Conference on Parallel Processing, 2019

A Plugin Architecture for the TAU Performance System.
Proceedings of the 48th International Conference on Parallel Processing, 2019

CCAMP: OpenMP and OpenACC Interoperable Framework.
Proceedings of the Euro-Par 2019: Parallel Processing Workshops, 2019

Scalable Performance Awareness for In Situ Scientific Applications.
Proceedings of the 15th International Conference on eScience, 2019

2018
The Long and Winding Road Toward Efficient High-Performance Computing.
Proc. IEEE, 2018

MPI performance engineering with the MPI tool interface: The integration of MVAPICH and TAU.
Parallel Comput., 2018

Exploring Hyper-Parameter Optimization for Neural Machine Translation on GPU Architectures.
CoRR, 2018

Transparent High-Speed Network Checkpoint/Restart in MPI.
Proceedings of the 25th European MPI Users' Group Meeting, 2018

Stingray-HPC: A Scalable Parallel Seismic Raytracing System.
Proceedings of the 26th Euromicro International Conference on Parallel, 2018

Tracking Memory Usage in OpenSHMEM Runtimes with the TAU Performance System.
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Extreme Heterogeneity, 2018

A Similarity Measure for GPU Kernel Subgraph Matching.
Proceedings of the Languages and Compilers for Parallel Computing, 2018

Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs.
Proceedings of the 32nd International Conference on Supercomputing, 2018


2017
Performance Analysis of Applications in the Context of Architectural Rooflines.
Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering, 2017

Projecting Performance Data over Simulation Geometry Using SOSflow and ALPINE.
Proceedings of the Programming and Performance Visualization Tools, 2017

Simulating Power Scheduling at Scale.
Proceedings of the 5th International Workshop on Energy Efficient Supercomputing, 2017

MPI performance engineering with the MPI tool interface: the integration of MVAPICH and TAU.
Proceedings of the 24th European MPI Users' Group Meeting, 2017

Performance Analysis of OpenSHMEM Applications with TAU Commander.
Proceedings of the OpenSHMEM and Related Technologies. Big Compute and Big Data Convergence, 2017

User Co-scheduling for MPI+OpenMP Applications Using OpenMP Semantics.
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017

Towards a Better Expressiveness of the Speedup Metric in MPI Context.
Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017

Autotuning GPU Kernels via Static and Predictive Analysis.
Proceedings of the 46th International Conference on Parallel Processing, 2017

QoS-Aware Virtual Machine Consolidation in Cloud Datacenter.
Proceedings of the 2017 IEEE International Conference on Cloud Engineering, 2017

2016
Concurrency in electrical neuroinformatics: parallel computation for studying the volume conduction of brain electrical fields in human head tissues.
Concurr. Comput. Pract. Exp., 2016

A Data Parallel Algorithm for Seismic Raytracing.
Proceedings of the High Performance Computing for Computational Science - VECPAR 2016, 2016

Scaling Spark on Lustre.
Proceedings of the High Performance Computing, 2016

A Scalable Observation System for Introspection and In Situ Analytics.
Proceedings of the 5th Workshop on Extreme-Scale Programming Tools, 2016

A Unified Platform for Exploring Power Management Strategies.
Proceedings of the 4th International Workshop on Energy Efficient Supercomputing, 2016

The UA?CG Workflow: High Performance Molecular Dynamics of Coarse-Grained Polymers.
Proceedings of the 24th Euromicro International Conference on Parallel, 2016

Profiling Production OpenSHMEM Applications.
Proceedings of the OpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments, 2016

A Hartree-Fock Application Using UPC++ and the New DArray Library.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Systemwide Power Management with Argo.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

WOWMON: A Machine Learning-based Profiler for Self-adaptive Instrumentation of Scientific Workflows.
Proceedings of the International Conference on Computational Science 2016, 2016

Scaling Spark on HPC Systems.
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016

Towards Scaling Parallel Seismic Raytracing.
Proceedings of the 2016 IEEE Intl Conference on Computational Science and Engineering, 2016

ARCS: Adaptive Runtime Configuration Selection for Power-Constrained OpenMP Applications.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016

2015
An Autonomic Performance Environment for Exascale.
Supercomput. Front. Innov., 2015

Fast equilibration of coarse-grained polymeric liquids.
J. Comput. Sci., 2015

Autoperf: Workflow Support for Performance Experiments.
Proceedings of the 2015 Workshop on Challenges in Performance Methods for Software Development, 2015

Dynamic power sharing for higher job throughput.
Proceedings of the International Conference for High Performance Computing, 2015

An MPI Halo-Cell Implementation for Zero-Copy Abstraction.
Proceedings of the 22nd European MPI Users' Group Meeting, 2015

A Performance Analysis of SIMD Algorithms for Monte Carlo Simulations of Nuclear Reactor Cores.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Parallel Performance Optimizations on Unstructured Mesh-based Simulations.
Proceedings of the International Conference on Computational Science, 2015

Through the Looking-Glass: From Performance Observation to Dynamic Adaptation.
Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015

POW: System-wide Dynamic Reallocation of Limited Power in HPC.
Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015

Identifying Optimization Opportunities Within Kernel Execution in GPU Codes.
Proceedings of the Euro-Par 2015: Parallel Processing Workshops, 2015

2014
Collective mind: Towards practical and collaborative auto-tuning.
Sci. Program., 2014

A 3D Finite-Difference BiCG Iterative Solver with the Fourier-Jacobi Preconditioner for the Anisotropic EIT/EEG Forward Problem.
Comput. Math. Methods Medicine, 2014

Linking performance data into scientific visualization tools.
Proceedings of the First Workshop on Visual Performance Analysis, 2014

General Hybrid Parallel Profiling.
Proceedings of the 22nd Euromicro International Conference on Parallel, 2014

Profiling Non-numeric OpenSHMEM Applications with the TAU Performance System.
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014

Integrated Measurement for Cross-Platform OpenMP Performance Analysis.
Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014

From MultiTask to MultiCore: Design and Implementation Using an RTOS.
Proceedings of the IEEE 13th International Symposium on Parallel and Distributed Computing, 2014

WorkQ: A many-core producer/consumer execution model applied to PGAS computations.
Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

Toward multi-target autotuning for accelerators.
Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014

Particle advection performance over varied architectures and workloads.
Proceedings of the 21st International Conference on High Performance Computing, 2014

2013
Tools for machine-learning-based empirical autotuning and specialization.
Int. J. High Perform. Comput. Appl., 2013

Open Source Task Profiling by Extending the OpenMP Runtime API.
Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013

Next-generation human brain neuroimaging and the role of high-performance computing.
Proceedings of the International Conference on High Performance Computing & Simulation, 2013

An early prototype of an autonomic performance environment for exascale.
Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, 2013

Inspector-Executor Load Balancing Algorithms for Block-Sparse Tensor Contractions.
Proceedings of the 42nd International Conference on Parallel Processing, 2013

MIL: A language to build program analysis tools through static binary instrumentation.
Proceedings of the 20th Annual International Conference on High Performance Computing, 2013

2012
Performance characterization of global address space applications: a case study with NWChem.
Concurr. Comput. Pract. Exp., 2012

Incorporating anatomical connectivity into EEG source estimation via sparse approximation with cortical graph wavelets.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Composing typemaps in Twig.
Proceedings of the Generative Programming and Component Engineering, 2012

Topic 2: Performance Prediction and Evaluation.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

A Type-Based Approach to Separating Protocol from Application Logic - A Case Study in Hybrid Computer Programming.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012

2011
Proceedings of the Encyclopedia of Parallel Computing, 2011

Metrics.
Proceedings of the Encyclopedia of Parallel Computing, 2011

Advances in the TAU Performance System.
Proceedings of the Tools for High Performance Computing 2011, 2011

Score-P: A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir.
Proceedings of the Tools for High Performance Computing 2011, 2011

Characterizing I/O Performance Using the TAU Performance System.
Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011

Computational Modeling of Human Head Electromagnetics for Source Localization of Milliscale Brain Dynamics.
Proceedings of the Medicine Meets Virtual Reality 18 - NextMed, 2011

Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs.
Proceedings of the International Conference on Parallel Processing, 2011

An Approach to Creating Performance Visualizations in a Parallel Profile Analysis Tool.
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011

Development of embedded multicore systems.
Proceedings of the IEEE 16th Conference on Emerging Technologies & Factory Automation, 2011

2010
A framework for scalable, parallel performance monitoring.
Concurr. Comput. Pract. Exp., 2010

Improving the Scalability of Performance Evaluation Tools.
Proceedings of the Applied Parallel and Scientific Computing, 2010

An experimental approach to performance measurement of heterogeneous parallel applications using CUDA.
Proceedings of the 24th International Conference on Supercomputing, 2010

Design and Implementation of a Hybrid Parallel Performance Measurement System.
Proceedings of the 39th International Conference on Parallel Processing, 2010

TAUmon: Scalable Online Performance Data Analysis in TAU.
Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010

Score-P: A Unified Performance Measurement System for Petascale Applications.
Proceedings of the Competence in High Performance Computing 2010, 2010

2009
Workflow representation and runtime based on lazy functional streams.
Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, 2009

Performance Measurement of Applications with GPU Acceleration using CUDA.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Performance Tool Integration in a GPU Programming Environment: Experiences with TAU and HMPP.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009

Integrated Performance Views in Charm++: Projections Meets TAU.
Proceedings of the ICPP 2009, 2009

A 3D Vector-Additive Iterative Solver for the Anisotropic Inhomogeneous Poisson Equation in the Forward EEG problem.
Proceedings of the Computational Science, 2009

An Open Domain-Extensible Environment for Simulation-Based Scientific Investigation (ODESSI).
Proceedings of the Computational Science, 2009

A Holistic Approach for Performance Measurement and Analysis for Petascale Applications.
Proceedings of the Computational Science, 2009

A Generic and Configurable Source-Code Instrumentation Component.
Proceedings of the Computational Science, 2009

2008
Knowledge support and automation for performance analysis with PerfExplorer 2.0.
Sci. Program., 2008

Integrated parallel performance views.
Clust. Comput., 2008

Capturing performance knowledge for automated analysis.
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008

Evolution of a Parallel Performance System.
Proceedings of the Tools for High Performance Computing, 2008

Performance Tool Workflows.
Proceedings of the Computational Science, 2008

Observing Performance Dynamics Using Parallel Profile Snapshots.
Proceedings of the Euro-Par 2008, 2008

Parametric Studies in Eclipse with TAU and PerfExplorer.
Proceedings of the Euro-Par 2008 Workshops, 2008

WOOL: A Workflow Programming Language.
Proceedings of the Fourth International Conference on e-Science, 2008

In search of sweet-spots in parallel performance monitoring.
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008

Conductivity Analysis for High-Resolution EEG.
Proceedings of the 2008 International Conference on BioMedical Engineering and Informatics, 2008

2007
Supporting Nested OpenMP Parallelism in the TAU Performance System.
Int. J. Parallel Program., 2007

Compensation of Measurement Overhead in Parallel Performance Profiling.
Int. J. High Perform. Comput. Appl., 2007

Performance modeling of component assemblies.
Concurr. Comput. Pract. Exp., 2007

Knowledge engineering for automatic parallel performance diagnosis.
Concurr. Comput. Pract. Exp., 2007

The ghost in the machine: observing the effects of kernel operation on parallel application performance.
Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007

Scalable, Automated Performance Analysis with TAU and PerfExplorer.
Proceedings of the Parallel Computing: Architectures, 2007

Development of NeuroElectroMagnetic ontologies(NEMO): a framework for mining brainwave ontologies.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Automatic Performance Diagnosis of Parallel Computations with Compositional Models.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007

Use of Parallel Simulated Annealing for Computational Modeling of Human Head Conductivity.
Proceedings of the Computational Science, 2007

<i>TAUoverSupermon</i> : Low-Overhead Online Parallel Performance Monitoring.
Proceedings of the Euro-Par 2007, 2007

A Semi-Automatic Framework for Mining ERP Patterns.
Proceedings of the 21st International Conference on Advanced Information Networking and Applications (AINA 2007), 2007

2006
The Tau Parallel Performance System.
Int. J. High Perform. Comput. Appl., 2006

A Component Architecture for High-Performance Scientific Computing.
Int. J. High Perform. Comput. Appl., 2006

Bridging the language gap in scientific computing: the Chasm approach.
Concurr. Comput. Pract. Exp., 2006

Automatic brain mr image segmentation by relative thresholding and morphological image analysis.
Proceedings of the VISAPP 2006: Proceedings of the First International Conference on Computer Vision Theory and Applications, 2006

Open trace - The open trace format (OTF) and open tracing for HPC.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

TAUg: Runtime Global Performance Data Access Using MPI.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006

Optimization of Instrumentation in Parallel Performance Evaluation Tools.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

Workload Characterization Using the TAU Performance System.
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006

A Multiscale Morphological Approach to Topology Correction of Cortical Surfaces.
Proceedings of the Medical Imaging and Augmented Reality, 2006

Parallel ICA methods for EEG neuroimaging.
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006

Integrating TAU with Eclipse: A Performance Analysis System in an Integrated Development Environment.
Proceedings of the High Performance Computing and Communications, 2006

Model-Based Relative Performance Diagnosis of Wavefront Parallel Computations.
Proceedings of the High Performance Computing and Communications, 2006

Early Experiences with KTAU on the IBM BG/L.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Model-Based Performance Diagnosis of Master-Worker Parallel Computations.
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006

Kernel-Level Measurement for Integrated Parallel Performance Views: the KTAU Project.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006

2005
Overhead Compensation in Performance Profiling.
Parallel Process. Lett., 2005

Performance technology for parallel and distributed component software.
Concurr. Pract. Exp., 2005

PerfExplorer: A Performance Data Mining Framework For Large-Scale Parallel Computing.
Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005

Performance Profiling Overhead Compensation for MPI Programs.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

A Scalable Approach to MPI Application Performance Analysis.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005

Towards the Performance Visualization of Web-Service Based Applications.
Proceedings of the Parallel Processing and Applied Mathematics, 2005

Phase-Based Parallel Performance Profiling.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005

Multi-cluster, Mixed-Mode Computational Modeling of Human Head Conductivity.
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005

Design and Implementation of a Parallel Performance Data Management Framework.
Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005

Computational Modeling of Human Head Conductivity.
Proceedings of the Computational Science, 2005

Trace-Based Parallel Performance Overhead Compensation.
Proceedings of the High Performance Computing and Communications, 2005

Models for On-the-Fly Compensation of Measurement Overhead in Parallel Performance Profiling.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

Topic 2 - Performance Prediction and Evaluation.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005

05501 Abstracts Collection - Automatic Performance Analysis.
Proceedings of the Automatic Performance Analysis, 12.-16. December 2005, 2005

05501 Summary - Automatic Performance Analysis.
Proceedings of the Automatic Performance Analysis, 12.-16. December 2005, 2005

2004
Performance Measurement and Modeling of Component Applications in a High Performance Computing Environment: A Case Study.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004

A Framework for Evaluating ICA Methods of Artifact Removal from Multichannel EEG.
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004

Topic 1: Support Tools and Environments.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

Computational Quality of Service for Scientific Components.
Proceedings of the Component-Based Software Engineering, 7th International Symposium, 2004

2003
Performance Analysis Integration in the Uintah Software Development Cycle.
Int. J. Parallel Program., 2003

Integration and application of TAU in parallel Java environments.
Concurr. Comput. Pract. Exp., 2003

Online Performance Observation of Large-Scale Parallel Applications.
Proceedings of the Parallel Computing: Software Technology, 2003

Online Remote Trace Analysis of Parallel Applications on High-Performance Clusters.
Proceedings of the High Performance Computing, 5th International Symposium, 2003

A Performance Interface for Component-Based Applications.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003

Performance Modeling for Dynamic Algorithm Selection.
Proceedings of the Computational Science - ICCS 2003, 2003

Performance Instrumentation and Measurement for Terascale Systems.
Proceedings of the Computational Science - ICCS 2003, 2003

Performance Evaluation and Prediction.
Proceedings of the Euro-Par 2003. Parallel Processing, 2003

ParaProf: A Portable, Extensible, and Scalable Tool for Parallel Performance Profile Analysis.
Proceedings of the Euro-Par 2003. Parallel Processing, 2003

A Distributed Performance Analysis Architecture for Clusters.
Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003

2002
Design and Prototype of a Performance Tool Interface for OpenMP.
J. Supercomput., 2002

Integrating Performance Analysis in the Uintah Software Development Cycle.
Proceedings of the High Performance Computing, 4th International Symposium, 2002

2001
Performance Technology for Complex Parallel and Distributed Systems.
Parallel Distributed Comput. Pract., 2001

A theory and architecture for automating performance diagnosis.
Future Gener. Comput. Syst., 2001

Performance data mining: Automated diagnosis, adaption, and optimization.
Future Gener. Comput. Syst., 2001

On using SCALEA for performance analysis of distributed and parallel programs.
Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001

Integration and applications of the TAU performance system in parallel Java environments.
Proceedings of the ACM 2001 Java Grande Conference, Stanford University, California, USA, 2001

Topic 02: Performance Evaluation and Prediction.
Proceedings of the Euro-Par 2001: Parallel Processing, 2001

2000
Computational experiments using distributed tools in a web-based electronic notebook environment.
Future Gener. Comput. Syst., 2000

A Tool Framework for Static and Dynamic Analysis of Object-Oriented Software with Templates.
Proceedings of the Proceedings Supercomputing 2000, 2000

Tools for Parallel Computing: A Performance Evaluation Perspective.
Proceedings of the Handbook on Parallel and Distributed Processing, 2000

1999
A Runtime Monitoring Framework for the TAU Profiling System.
Proceedings of the Computing in Object-Oriented Parallel Environments, 1999

SMARTS: exploiting temporal locality and parallelism through vertical execution.
Proceedings of the 13th international conference on Supercomputing, 1999

Computational Experiments using Distributed Tools in a Web-Based Electronic Notebook Environment.
Proceedings of the High-Performance Computing and Networking, 7th International Conference, 1999

INTERLACE: An Interoperation and Linking Architecture for Computational Engines.
Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999

1998
DAQV: Distributed Array Query and Visualization Framework.
Theor. Comput. Sci., 1998

Portable profiling and tracing for parallel, scientific applications using C++.
Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, 1998

An IL converter and program database for analysis tools.
Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, 1998

A Prototype Notebook-Based Environment for Computational Tools Computational Tools.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1998

Supporting Runtime Tool Interaction for Parallel Simulations.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1998

Dynamic Performance Callstack Sampling: Merging TAU and DAQV.
Proceedings of the Applied Parallel Computing, 1998

A Framework for Interacting with Distributed Programs and Data.
Proceedings of the Seventh IEEE International Symposium on High Performance Distributed Computing, 1998

Performance Evaluation and Prediction.
Proceedings of the Euro-Par '98 Parallel Processing, 1998

1997
Building Domain-Specific Environments for Computational Science: a Case Study in Seismic Tomography.
Int. J. High Perform. Comput. Appl., 1997

1996
Distributed Arry Query and Visualization for High Performance Fortran.
Proceedings of the Euro-Par '96 Parallel Processing, 1996

1995
Parallel performance visualization: from practice to theory.
IEEE Parallel Distributed Technol. Syst. Appl., 1995

The Visual Display of Parallel Performance Data.
Computer, 1995

Visualizing parallel programs and performance.
IEEE Computer Graphics and Applications, 1995

Data Interpretation and Experiment Planning in Performance Tools (Panel).
Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, 1995

Speedy: An Integrated Performance Extrapolation Tool for pC++ Programs.
Proceedings of the Quantitative Evaluation of Computing and Communication Systems, 1995

Performance-Oriented Development of Irregular, Unstructured and Unbalanced Parallel Applications in the N-MAP Environment.
Proceedings of the Quantitative Evaluation of Computing and Communication Systems, 1995

Capturing and automating performance diagnosis: the Poirot approach.
Proceedings of IPPS '95, 1995

Performance Extrapolation of Parallel Programs.
Proceedings of the 1995 International Conference on Parallel Processing, 1995

1994
Next-Generation Parallel Performance Visualization: A Prototyping Environment for Visualization Development.
Proceedings of the PARLE '94: Parallel Architectures and Languages Europe, 1994

Performance Analysis of pC++: A Portable Data-Parallel Programming System for Scalable Parallel Computers.
Proceedings of the 8th International Symposium on Parallel Processing, 1994

Stochastic Modeling of Scaled Parallel Programs.
Proceedings of the Proceedings 1994 International Conference on Parallel and Distributed Systems, 1994

Automatic Scalability Analysis of Parallel Programs Based on Modeling Techniques.
Proceedings of the Computer Performance Evaluation, 1994

TAU: A Portable Parallel Program Analysis Environment for pC++.
Proceedings of the Parallel Processing: CONPAR 94, 1994

1993
Supercomputing around the world (Mini symposium).
Proceedings of the Proceedings Supercomputing '93, 1993


Implementing a parallel C++ runtime system for scalable parallel systems.
Proceedings of the Proceedings Supercomputing '93, 1993

Perturbation Analysis of High Level Instrumentation for SPMD Programs.
Proceedings of the Fourth ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1993

1992
Performance Measurement Intrusion and Perturbation Analysis.
IEEE Trans. Parallel Distributed Syst., 1992

Supercomputing Around the World.
Proceedings of the Proceedings Supercomputing '92, 1992

1991
Tracing application program execution on the CRAY X-MP and CRAY-2.
J. Supercomput., 1991

Traceview: A Trace Visualization Tool.
IEEE Softw., 1991

Performance Prediction for Parallel Numerical Algorithms.
Int. J. High Speed Comput., 1991

Event-Based Performance Perturbation: A Case Study.
Proceedings of the Third ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1991

Models for Performance Perturbation Analysis.
Proceedings of the ACM/ONR Workshop on Parallel and Distributed Debugging, 1991

Trace View: A Trace Visualization Tool.
Proceedings of the Parallel Computation, First International ACPC Conference, Salzburg, Austria, September 30, 1991

1990
Performance observability
PhD thesis, 1990

Experimentally Characterizing the Behavior of Multiprocessor Memory Systems. A Case Study.
IEEE Trans. Software Eng., 1990

Run-time monitoring of concurrent programs on the Cedar multiprocessor.
Proceedings of the Proceedings Supercomputing '90, New York, NY, USA, November 12-16, 1990, 1990

A hardware-based performance monitor for the Intel iPSC/2 hypercube.
Proceedings of the 4th international conference on Supercomputing, 1990

1989
Faust: An Integrated Environment for Parallel Programming.
IEEE Softw., 1989

Behavioral Characterization of Multiprocessor Memory Systems: A Case Study.
Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, 1989

Performance prediction of loop constructs on multiprocessor hierarchical-memory systems.
Proceedings of the 3rd international conference on Supercomputing, 1989

1988
Parallel Discrete Event Simulation Using Shared Memory.
IEEE Trans. Software Eng., 1988

1987
Parallel Discrete Event Simulation: A Shared Memory Approach.
Proceedings of the 1987 ACM SIGMETRICS conference on Measurement and modeling of computer systems, 1987

MPF: A Portable Message Passing Facility for Shared Memory Multiprocessors.
Proceedings of the International Conference on Parallel Processing, 1987

1986
Vector Processing on the Alliant FX/8 Multiprocessor.
Proceedings of the International Conference on Parallel Processing, 1986


  Loading...