Allen D. Malony
Orcid: 0000-0002-9598-7201Affiliations:
- University of Oregon, Eugene, USA
According to our database1,
Allen D. Malony
authored at least 255 papers
between 1986 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
Concurr. Comput. Pract. Exp., August, 2024
Understanding the Impact of openPMD on BIT1, a Particle-in-Cell Monte Carlo Code, through Instrumentation, Monitoring, and In-Situ Analysis.
CoRR, 2024
Integration of Modern HPC Performance Tools in Vlasiator for Exascale Analysis and Optimization.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
Proceedings of the 53rd International Conference on Parallel Processing, 2024
2023
ZeroSum: User Space Monitoring of Resource Utilization and Contention on Heterogeneous HPC Systems.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
2022
J. Open Source Softw., 2022
Translating High-Performance Computing Tools From Research to Practice: Experiences With the TAU Performance System.
Comput. Sci. Eng., 2022
MAPredict: Static Analysis Driven Memory Access Prediction Framework for Modern CPUs.
Proceedings of the High Performance Computing - 37th International Conference, 2022
Proceedings of the SC22: International Conference for High Performance Computing, 2022
Proceedings of the EuroMPI/USA'22: 29th European MPI Users' Group Meeting, Chattanooga, TN, USA, September 26, 2022
Proceedings of the IEEE/ACM International Workshop on Performance, 2022
MARTINI: The Little Match and Replace Tool for Automatic Application Rewriting with Code Examples.
Proceedings of the Euro-Par 2022: Parallel Processing, 2022
Proceedings of the 18th IEEE International Conference on e-Science, 2022
HPC Storage Service Autotuning Using Variational- Autoencoder -Guided Asynchronous Bayesian Optimization.
Proceedings of the IEEE International Conference on Cluster Computing, 2022
2021
Optimization with the OpenACC-to-FPGA framework on the Arria 10 and Stratix 10 FPGAs.
Parallel Comput., 2021
CoRR, 2021
Concurr. Comput. Pract. Exp., 2021
Artemis: Automatic Runtime Tuning of Parallel Execution Parameters Using Machine Learning.
Proceedings of the High Performance Computing - 36th International Conference, 2021
Proceedings of the IEEE/ACM Redefining Scalability for Diversely Heterogeneous Architectures Workshop, 2021
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Exascale and Smart Networks, 2021
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021
Proceedings of the 28th IEEE International Conference on High Performance Computing, 2021
Proceedings of the IEEE International Conference on Cluster Computing, 2021
2020
Proceedings of the IEEE/ACM Workshop on Memory Centric High Performance Computing, 2020
Proceedings of the International Conference for High Performance Computing, 2020
Proceedings of the IEEE/ACM International Workshop on HPC User Support Tools and Workshop on Programming and Performance Visualization Tools, 2020
Proceedings of the IWOCL '20: International Workshop on OpenCL, 2020
Proceedings of the 19th International Symposium on Parallel and Distributed Computing, 2020
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020
Identifying Optimization Opportunities Using Memory Access Tracing in OpenSHMEM Runtimes with the TAU Performance System.
Proceedings of the 2020 IEEE International Parallel and Distributed Processing Symposium Workshops, 2020
Proceedings of the PACT '20: International Conference on Parallel Architectures and Compilation Techniques, 2020
2019
Proceedings of the IEEE/ACM International Workshop on Programming and Performance Visualization Tools, 2019
Proceedings of the 2019 IEEE/ACM Workshop on Education for High-Performance Computing, 2019
Proceedings of the 26th European MPI Users' Group Meeting, 2019
Proceedings of the Parallel Processing and Applied Mathematics, 2019
When Parallel Performance Measurement and Analysis Meets In Situ Analytics and Visualization.
Proceedings of the Parallel Computing: Technology Trends, 2019
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019
Proceedings of the 48th International Conference on Parallel Processing, 2019
Proceedings of the 48th International Conference on Parallel Processing, 2019
Proceedings of the Euro-Par 2019: Parallel Processing Workshops, 2019
Proceedings of the 15th International Conference on eScience, 2019
2018
Proc. IEEE, 2018
MPI performance engineering with the MPI tool interface: The integration of MVAPICH and TAU.
Parallel Comput., 2018
Exploring Hyper-Parameter Optimization for Neural Machine Translation on GPU Architectures.
CoRR, 2018
Proceedings of the 25th European MPI Users' Group Meeting, 2018
Proceedings of the 26th Euromicro International Conference on Parallel, 2018
Proceedings of the OpenSHMEM and Related Technologies. OpenSHMEM in the Era of Extreme Heterogeneity, 2018
Proceedings of the Languages and Compilers for Parallel Computing, 2018
Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs.
Proceedings of the 32nd International Conference on Supercomputing, 2018
Proceedings of the 14th IEEE International Conference on e-Science, 2018
2017
Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering, 2017
Proceedings of the Programming and Performance Visualization Tools, 2017
Proceedings of the 5th International Workshop on Energy Efficient Supercomputing, 2017
MPI performance engineering with the MPI tool interface: the integration of MVAPICH and TAU.
Proceedings of the 24th European MPI Users' Group Meeting, 2017
Proceedings of the OpenSHMEM and Related Technologies. Big Compute and Big Data Convergence, 2017
Proceedings of the Scaling OpenMP for Exascale Performance and Portability, 2017
Proceedings of the 46th International Conference on Parallel Processing Workshops, 2017
Proceedings of the 46th International Conference on Parallel Processing, 2017
Proceedings of the 2017 IEEE International Conference on Cloud Engineering, 2017
2016
Concurrency in electrical neuroinformatics: parallel computation for studying the volume conduction of brain electrical fields in human head tissues.
Concurr. Comput. Pract. Exp., 2016
Proceedings of the High Performance Computing for Computational Science - VECPAR 2016, 2016
Proceedings of the 5th Workshop on Extreme-Scale Programming Tools, 2016
Proceedings of the 4th International Workshop on Energy Efficient Supercomputing, 2016
Proceedings of the 24th Euromicro International Conference on Parallel, 2016
Proceedings of the OpenSHMEM and Related Technologies. Enhancing OpenSHMEM for Hybrid Environments, 2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016
WOWMON: A Machine Learning-based Profiler for Self-adaptive Instrumentation of Scientific Workflows.
Proceedings of the International Conference on Computational Science 2016, 2016
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016
Proceedings of the 2016 IEEE Intl Conference on Computational Science and Engineering, 2016
ARCS: Adaptive Runtime Configuration Selection for Power-Constrained OpenMP Applications.
Proceedings of the 2016 IEEE International Conference on Cluster Computing, 2016
2015
Proceedings of the 2015 Workshop on Challenges in Performance Methods for Software Development, 2015
Proceedings of the International Conference for High Performance Computing, 2015
Proceedings of the 22nd European MPI Users' Group Meeting, 2015
A Performance Analysis of SIMD Algorithms for Monte Carlo Simulations of Nuclear Reactor Cores.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
Proceedings of the International Conference on Computational Science, 2015
Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015
Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, 2015
Proceedings of the Euro-Par 2015: Parallel Processing Workshops, 2015
2014
Sci. Program., 2014
A 3D Finite-Difference BiCG Iterative Solver with the Fourier-Jacobi Preconditioner for the Anisotropic EIT/EEG Forward Problem.
Comput. Math. Methods Medicine, 2014
Proceedings of the First Workshop on Visual Performance Analysis, 2014
Proceedings of the 22nd Euromicro International Conference on Parallel, 2014
Proceedings of the OpenSHMEM and Related Technologies. Experiences, Implementations, and Tools, 2014
Proceedings of the Using and Improving OpenMP for Devices, Tasks, and More, 2014
Proceedings of the IEEE 13th International Symposium on Parallel and Distributed Computing, 2014
Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014
Proceedings of the 20th IEEE International Conference on Parallel and Distributed Systems, 2014
Proceedings of the 21st International Conference on High Performance Computing, 2014
2013
Int. J. High Perform. Comput. Appl., 2013
Proceedings of the OpenMP in the Era of Low Power Devices and Accelerators, 2013
Proceedings of the International Conference on High Performance Computing & Simulation, 2013
Proceedings of the 3rd International Workshop on Runtime and Operating Systems for Supercomputers, 2013
Proceedings of the 42nd International Conference on Parallel Processing, 2013
MIL: A language to build program analysis tools through static binary instrumentation.
Proceedings of the 20th Annual International Conference on High Performance Computing, 2013
2012
Performance characterization of global address space applications: a case study with NWChem.
Concurr. Comput. Pract. Exp., 2012
Incorporating anatomical connectivity into EEG source estimation via sparse approximation with cortical graph wavelets.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
Proceedings of the Generative Programming and Component Engineering, 2012
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012
A Type-Based Approach to Separating Protocol from Application Logic - A Case Study in Hybrid Computer Programming.
Proceedings of the Euro-Par 2012 Parallel Processing - 18th International Conference, 2012
2011
Proceedings of the Tools for High Performance Computing 2011, 2011
Score-P: A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir.
Proceedings of the Tools for High Performance Computing 2011, 2011
Proceedings of the Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August, 2011
Computational Modeling of Human Head Electromagnetics for Source Localization of Milliscale Brain Dynamics.
Proceedings of the Medicine Meets Virtual Reality 18 - NextMed, 2011
Proceedings of the International Conference on Parallel Processing, 2011
An Approach to Creating Performance Visualizations in a Parallel Profile Analysis Tool.
Proceedings of the Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29, 2011
Proceedings of the IEEE 16th Conference on Emerging Technologies & Factory Automation, 2011
2010
Concurr. Comput. Pract. Exp., 2010
Proceedings of the Applied Parallel and Scientific Computing, 2010
An experimental approach to performance measurement of heterogeneous parallel applications using CUDA.
Proceedings of the 24th International Conference on Supercomputing, 2010
Proceedings of the 39th International Conference on Parallel Processing, 2010
Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010
Proceedings of the Competence in High Performance Computing 2010, 2010
2009
Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, 2009
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009
Performance Tool Integration in a GPU Programming Environment: Experiences with TAU and HMPP.
Proceedings of the Parallel Computing: From Multicores and GPU's to Petascale, 2009
Proceedings of the ICPP 2009, 2009
A 3D Vector-Additive Iterative Solver for the Anisotropic Inhomogeneous Poisson Equation in the Forward EEG problem.
Proceedings of the Computational Science, 2009
An Open Domain-Extensible Environment for Simulation-Based Scientific Investigation (ODESSI).
Proceedings of the Computational Science, 2009
A Holistic Approach for Performance Measurement and Analysis for Petascale Applications.
Proceedings of the Computational Science, 2009
Proceedings of the Computational Science, 2009
2008
Sci. Program., 2008
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2008
Proceedings of the Tools for High Performance Computing, 2008
Proceedings of the Euro-Par 2008, 2008
Proceedings of the Euro-Par 2008 Workshops, 2008
Proceedings of the Fourth International Conference on e-Science, 2008
Proceedings of the 2008 IEEE International Conference on Cluster Computing, 29 September, 2008
Proceedings of the 2008 International Conference on BioMedical Engineering and Informatics, 2008
2007
Int. J. Parallel Program., 2007
Int. J. High Perform. Comput. Appl., 2007
Concurr. Comput. Pract. Exp., 2007
The ghost in the machine: observing the effects of kernel operation on parallel application performance.
Proceedings of the ACM/IEEE Conference on High Performance Networking and Computing, 2007
Scalable, Automated Performance Analysis with TAU and PerfExplorer.
Proceedings of the Parallel Computing: Architectures, 2007
Development of NeuroElectroMagnetic ontologies(NEMO): a framework for mining brainwave ontologies.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007
Use of Parallel Simulated Annealing for Computational Modeling of Human Head Conductivity.
Proceedings of the Computational Science, 2007
Proceedings of the Euro-Par 2007, 2007
Proceedings of the 21st International Conference on Advanced Information Networking and Applications (AINA 2007), 2007
2006
Int. J. High Perform. Comput. Appl., 2006
Concurr. Comput. Pract. Exp., 2006
Automatic brain mr image segmentation by relative thresholding and morphological image analysis.
Proceedings of the VISAPP 2006: Proceedings of the First International Conference on Computer Vision Theory and Applications, 2006
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2006
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006
Proceedings of the Applied Parallel Computing. State of the Art in Scientific Computing, 2006
Proceedings of the Medical Imaging and Augmented Reality, 2006
Proceedings of the 20th International Parallel and Distributed Processing Symposium (IPDPS 2006), 2006
Integrating TAU with Eclipse: A Performance Analysis System in an Integrated Development Environment.
Proceedings of the High Performance Computing and Communications, 2006
Proceedings of the High Performance Computing and Communications, 2006
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006
Proceedings of the Euro-Par 2006, Parallel Processing, 12th International Euro-Par Conference, Dresden, Germany, August 28, 2006
Kernel-Level Measurement for Integrated Parallel Performance Views: the KTAU Project.
Proceedings of the 2006 IEEE International Conference on Cluster Computing, 2006
2005
Concurr. Pract. Exp., 2005
PerfExplorer: A Performance Data Mining Framework For Large-Scale Parallel Computing.
Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005
Proceedings of the Parallel Processing and Applied Mathematics, 2005
Phase-Based Parallel Performance Profiling.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005
Proceedings of the OpenMP Shared Memory Parallel Programming - International Workshops, 2005
Proceedings of the 34th International Conference on Parallel Processing (ICPP 2005), 2005
Proceedings of the Computational Science, 2005
Proceedings of the High Performance Computing and Communications, 2005
Models for On-the-Fly Compensation of Measurement Overhead in Parallel Performance Profiling.
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005
Proceedings of the Euro-Par 2005, Parallel Processing, 11th International Euro-Par Conference, Lisbon, Portugal, August 30, 2005
Proceedings of the Automatic Performance Analysis, 12.-16. December 2005, 2005
Proceedings of the Automatic Performance Analysis, 12.-16. December 2005, 2005
2004
Performance Measurement and Modeling of Component Applications in a High Performance Computing Environment: A Case Study.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004
Proceedings of the Independent Component Analysis and Blind Signal Separation, 2004
Proceedings of the Euro-Par 2004 Parallel Processing, 2004
Proceedings of the Component-Based Software Engineering, 7th International Symposium, 2004
2003
Int. J. Parallel Program., 2003
Concurr. Comput. Pract. Exp., 2003
Online Performance Observation of Large-Scale Parallel Applications.
Proceedings of the Parallel Computing: Software Technology, 2003
Proceedings of the High Performance Computing, 5th International Symposium, 2003
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003
Proceedings of the Computational Science - ICCS 2003, 2003
Proceedings of the Computational Science - ICCS 2003, 2003
Proceedings of the Euro-Par 2003. Parallel Processing, 2003
ParaProf: A Portable, Extensible, and Scalable Tool for Parallel Performance Profile Analysis.
Proceedings of the Euro-Par 2003. Parallel Processing, 2003
Proceedings of the 2003 IEEE International Conference on Cluster Computing (CLUSTER 2003), 2003
2002
J. Supercomput., 2002
Proceedings of the High Performance Computing, 4th International Symposium, 2002
2001
Parallel Distributed Comput. Pract., 2001
Future Gener. Comput. Syst., 2001
Future Gener. Comput. Syst., 2001
Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001
Integration and applications of the TAU performance system in parallel Java environments.
Proceedings of the ACM 2001 Java Grande Conference, Stanford University, California, USA, 2001
Proceedings of the Euro-Par 2001: Parallel Processing, 2001
2000
Computational experiments using distributed tools in a web-based electronic notebook environment.
Future Gener. Comput. Syst., 2000
A Tool Framework for Static and Dynamic Analysis of Object-Oriented Software with Templates.
Proceedings of the Proceedings Supercomputing 2000, 2000
Proceedings of the Handbook on Parallel and Distributed Processing, 2000
1999
Proceedings of the Computing in Object-Oriented Parallel Environments, 1999
Proceedings of the 13th international conference on Supercomputing, 1999
Computational Experiments using Distributed Tools in a Web-Based Electronic Notebook Environment.
Proceedings of the High-Performance Computing and Networking, 7th International Conference, 1999
Proceedings of the Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31, 1999
1998
Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, 1998
Proceedings of the SIGMETRICS Symposium on Parallel and Distributed Tools, 1998
Proceedings of the ACM/IEEE Conference on Supercomputing, 1998
Proceedings of the ACM/IEEE Conference on Supercomputing, 1998
Proceedings of the Applied Parallel Computing, 1998
Proceedings of the Seventh IEEE International Symposium on High Performance Distributed Computing, 1998
Proceedings of the Euro-Par '98 Parallel Processing, 1998
1997
Building Domain-Specific Environments for Computational Science: a Case Study in Seismic Tomography.
Int. J. High Perform. Comput. Appl., 1997
1996
Proceedings of the Euro-Par '96 Parallel Processing, 1996
1995
IEEE Parallel Distributed Technol. Syst. Appl., 1995
IEEE Computer Graphics and Applications, 1995
Proceedings of the 1995 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, 1995
Proceedings of the Quantitative Evaluation of Computing and Communication Systems, 1995
Performance-Oriented Development of Irregular, Unstructured and Unbalanced Parallel Applications in the N-MAP Environment.
Proceedings of the Quantitative Evaluation of Computing and Communication Systems, 1995
Proceedings of IPPS '95, 1995
Performance Extrapolation of Parallel Programs.
Proceedings of the 1995 International Conference on Parallel Processing, 1995
1994
Next-Generation Parallel Performance Visualization: A Prototyping Environment for Visualization Development.
Proceedings of the PARLE '94: Parallel Architectures and Languages Europe, 1994
Performance Analysis of pC++: A Portable Data-Parallel Programming System for Scalable Parallel Computers.
Proceedings of the 8th International Symposium on Parallel Processing, 1994
Proceedings of the Proceedings 1994 International Conference on Parallel and Distributed Systems, 1994
Proceedings of the Computer Performance Evaluation, 1994
Proceedings of the Parallel Processing: CONPAR 94, 1994
1993
Proceedings of the Proceedings Supercomputing '93, 1993
Proceedings of the Proceedings Supercomputing '93, 1993
Proceedings of the Proceedings Supercomputing '93, 1993
Proceedings of the Fourth ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1993
1992
IEEE Trans. Parallel Distributed Syst., 1992
Proceedings of the Proceedings Supercomputing '92, 1992
1991
J. Supercomput., 1991
Int. J. High Speed Comput., 1991
Proceedings of the Third ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming (PPOPP), 1991
Proceedings of the ACM/ONR Workshop on Parallel and Distributed Debugging, 1991
Proceedings of the Parallel Computation, First International ACPC Conference, Salzburg, Austria, September 30, 1991
1990
Experimentally Characterizing the Behavior of Multiprocessor Memory Systems. A Case Study.
IEEE Trans. Software Eng., 1990
Proceedings of the Proceedings Supercomputing '90, New York, NY, USA, November 12-16, 1990, 1990
Proceedings of the 4th international conference on Supercomputing, 1990
1989
Proceedings of the 1989 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, 1989
Performance prediction of loop constructs on multiprocessor hierarchical-memory systems.
Proceedings of the 3rd international conference on Supercomputing, 1989
1988
IEEE Trans. Software Eng., 1988
1987
Proceedings of the 1987 ACM SIGMETRICS conference on Measurement and modeling of computer systems, 1987
MPF: A Portable Message Passing Facility for Shared Memory Multiprocessors.
Proceedings of the International Conference on Parallel Processing, 1987
1986
Vector Processing on the Alliant FX/8 Multiprocessor.
Proceedings of the International Conference on Parallel Processing, 1986