Paul D. Hovland

Orcid: 0000-0002-0907-2567

According to our database1, Paul D. Hovland authored at least 83 papers between 1992 and 2025.

Collaborative distances:



In proceedings 
PhD thesis 




MITgcm-AD v2: Open source tangent linear and adjoint modeling framework for the oceans and atmosphere enabled by the Automatic Differentiation tool Tapenade.
Future Gener. Comput. Syst., 2025

A taxonomy of automatic differentiation pitfalls.
WIREs Data. Mining. Knowl. Discov., 2024

QuCLEAR: Clifford Extraction and Absorption for Significant Reduction in Quantum Circuit Size.
CoRR, 2024

Differentiating Through Linear Solvers.
CoRR, 2024

Integrating ytopt and libEnsemble to Autotune OpenMC.
CoRR, 2024

QuTracer: Mitigating Quantum Gate and Measurement Errors by Tracing Subsets of Qubits.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024

Understanding Automatic Differentiation Pitfalls.
CoRR, 2023

ytopt: Autotuning Scientific Applications for Energy Efficiency at Large Scales.
CoRR, 2023

Tackling the Qubit Mapping Problem with Permutation-Aware Synthesis.
Proceedings of the IEEE International Conference on Quantum Computing and Engineering, 2023

QContext: Context-Aware Decomposition for Quantum Gates.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2023

Transfer-learning-based Autotuning using Gaussian Copula.
Proceedings of the 37th International Conference on Supercomputing, 2023

Accelerating VQE Algorithm via Parameters and Measurement Reuse.
Proceedings of the IEEE International Conference on Rebooting Computing, 2023

Enhancing Virtual Distillation with Circuit Cutting for Quantum Error Mitigation.
Proceedings of the 41st IEEE International Conference on Computer Design, 2023

Model Checking Race-Freedom When "Sequential Consistency for Data-Race-Free Programs" is Guaranteed.
Proceedings of the Computer Aided Verification - 35th International Conference, 2023

Autotuning PolyBench benchmarks with LLVM Clang/Polly loop optimization pragmas using Bayesian optimization.
Concurr. Comput. Pract. Exp., 2022

Verifying Fortran Programs with CIVL.
Proceedings of the Tools and Algorithms for the Construction and Analysis of Systems, 2022

Improving PyDDA's atmospheric wind retrievals using automatic differentiation and Augmented Lagrangian methods.
Proceedings of the 21st Python in Science Conference 2022, 2022

Scalable Automatic Differentiation of Multiple Parallel Paradigms through Compiler Augmentation.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Reducing Memory Requirements of Quantum Optimal Control.
Proceedings of the Computational Science - ICCS 2022, 2022

TROPHY: Trust Region Optimization Using a Precision Hierarchy.
Proceedings of the Computational Science - ICCS 2022, 2022

Autotuning PolyBench Benchmarks with LLVM Clang/Polly Loop Optimization Pragmas Using Bayesian Optimization (extended version).
CoRR, 2021

Customized Monte Carlo Tree Search for LLVM/Polly's Composable Loop Optimization Transformations.
Proceedings of the 2021 International Workshop on Performance Modeling, 2021

Error Estimation and Correction Using the Forward CENA Method.
Proceedings of the Computational Science - ICCS 2021, 2021

Vector Forward Mode Automatic Differentiation on SIMD/SIMT architectures.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020

Reverse-mode algorithmic differentiation of an OpenMP-parallel compressible flow solver.
Int. J. High Perform. Comput. Appl., 2019

Computing Derivatives for PETSc Adjoint Solvers using Algorithmic Differentiation.
CoRR, 2019

Training on the Edge: The why and the how.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2019

Automatic Differentiation for Adjoint Stencil Loops.
Proceedings of the 48th International Conference on Parallel Processing, 2019

Combining Checkpointing and Data Compression to Accelerate Adjoint-Based Optimization Problems.
Proceedings of the Euro-Par 2019: Parallel Processing, 2019

Parallelizable adjoint stencil computations using transposed forward-mode algorithmic differentiation.
Optim. Methods Softw., 2018

Combining checkpointing and data compression for large scale seismic inversion.
CoRR, 2018

Verifying Properties of Differentiable Programs.
Proceedings of the Static Analysis - 25th International Symposium, 2018

Vectorised Computation of Diverging Ensembles.
Proceedings of the 47th International Conference on Parallel Processing, 2018

Report of the HPC Correctness Summit, Jan 25-26, 2017, Washington, DC.
CoRR, 2017

Towards Self-Verification in Finite Difference Code Generation.
Proceedings of the First International Workshop on Software Correctness for HPC Applications, 2017

Optimal Multistage Algorithm for Adjoint Computation.
SIAM J. Sci. Comput., 2016

AutoMOMML: Automatic Multi-objective Modeling with Machine Learning.
Proceedings of the High Performance Computing - 31st International Conference, 2016

Edge Pushing is Equivalent to Vertex Elimination for Computing Hessians.
Proceedings of the 2016 Proceedings of the Seventh SIAM Workshop on Combinatorial Scientific Computing, 2016

Exploiting Performance Portability in Search Algorithms for Autotuning.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

Generating Efficient Tensor Contractions for GPUs.
Proceedings of the 44th International Conference on Parallel Processing, 2015

Autotuning FPGA Design Parameters for Performance and Power.
Proceedings of the 23rd IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2015

Collective I/O Tuning Using Analytical and Machine Learning Models.
Proceedings of the 2015 IEEE International Conference on Cluster Computing, 2015

Analysis of the Tradeoffs Between Energy and Run Time for Multilevel Checkpointing.
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014

Energy-performance tradeoffs in multilevel checkpoint strategies.
Proceedings of the 2014 IEEE International Conference on Cluster Computing, 2014

Software Abstractions and Methodologies for HPC Simulation Codes on Future Architectures.
CoRR, 2013

Empirical performance modeling of GPU kernels using active learning.
Proceedings of the Parallel Computing: Accelerating Computational Science and Engineering (CSE), 2013

Exascale workload characterization and architecture implications.
Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software, 2013

An Experimental Study of Global and Local Search Algorithms in Empirical Performance Tuning.
Proceedings of the High Performance Computing for Computational Science, 2012

Poster: An Exascale Workload Study.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: An Exascale Workload Study.
Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Sparse Jacobian Computation Using ADIC2 and ColPack.
Proceedings of the International Conference on Computational Science, 2011

Can search algorithms save large-scale automatic performance tuning?
Proceedings of the International Conference on Computational Science, 2011

Speeding up Nek5000 with autotuning and specialization.
Proceedings of the 24th International Conference on Supercomputing, 2010

Generating Performance Bounds from Source Code.
Proceedings of the 39th International Conference on Parallel Processing, 2010

Autotuning and Specialization: Speeding up Matrix Multiply for Small Matrices with Compiler Technology.
Proceedings of the Software Automatic Tuning, From Concepts to State-of-the-Art Results, 2010

Automatic Differentiation: Parallel Computation.
Proceedings of the Encyclopedia of Optimization, Second Edition, 2009

Toward adjoinable MPI.
Proceedings of the 23rd IEEE International Symposium on Parallel and Distributed Processing, 2009

Evaluation of Hierarchical Mesh Reorderings.
Proceedings of the Computational Science, 2009

Improving Random Walk Performance.
Proceedings of the 2009 International Conference on Scientific Computing, 2009

On the implementation of automatic differentiation tools.
High. Order Symb. Comput., 2008

A Framework for Proving Correctness of Adjoint Message-Passing Programs.
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2008

Improving the Performance of Graph Coloring Algorithms through Backtracking.
Proceedings of the Computational Science, 2008

Comparison of two activity analyses for automatic differentiation: context-sensitive flow-insensitive vs. context-insensitive flow-sensitive.
Proceedings of the 2007 ACM Symposium on Applied Computing (SAC), 2007

Term Graphs for Computing Derivatives in Imperative Languages.
Proceedings of the Third International Workshop on Term Graph Rewriting, 2006

Data-Flow Analysis for MPI Programs.
Proceedings of the 2006 International Conference on Parallel Processing (ICPP 2006), 2006

Linearity Analysis for Automatic Differentiation.
Proceedings of the Computational Science, 2006

Hybrid Static/Dynamic Activity Analysis.
Proceedings of the Computational Science, 2006

Making automatic differentiation truly automatic: coupling PETSc with ADIC.
Future Gener. Comput. Syst., 2005

Representation-independent program analysis.
Proceedings of the 2005 ACM SIGPLAN-SIGSOFT Workshop on Program Analysis For Software Tools and Engineering, 2005

Metrics and models for reordering transformations.
Proceedings of the 2004 workshop on Memory System Performance, 2004

Parallel components for PDEs and optimization: some issues and experiences.
Parallel Comput., 2002

Parallel simulation of compressible flow using automatic differentiation and PETSc.
Parallel Comput., 2001

A Distributed Application Server for Automatic Differentiation.
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001

On Combining Computational Differentiation and Toolkits for Parallel Scientific Computing.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

Solving Nonlinear PDEs Using PETSc and Automatic Differentiation.
Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999

Infrastructure and Interfaces for Large-Scale Numerical Software.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999

Automatic Differentiation for Message-Passing Parallel Programs.
Proceedings of the 12th International Parallel Processing Symposium / 9th Symposium on Parallel and Distributed Processing (IPPS/SPDP '98), March 30, 1998

Automatic Differentiation of Parallel Programs
PhD thesis, 1997

Efficient Derivative Codes through Automatic Differentiation and Interface Contraction: An Application in Biostatistics.
SIAM J. Sci. Comput., 1997

Automatic Differentiation of a Parallel Molecular Dynamics Application.
Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing, 1997

A Model for Automatic Dta Partitioning.
Proceedings of the 1993 International Conference on Parallel Processing, 1993

Further Research on Feature Selection and Classification Using Genetic Algorithms.
Proceedings of the 5th International Conference on Genetic Algorithms, 1993

ADIFOR - Generating Derivative Codes form Fortran Programs.
Sci. Program., 1992
