David E. Keyes
Orcid: 0000-0002-4052-7224Affiliations:
- King Abdullah University of Science and Technology, Thuwal, Saudia Arabia
- Columbia University, New York, NY, USA
- Old Dominion University, Norfolk, VA, USA
- Yale Unviersity, New Haven, CT, USA
According to our database1,
David E. Keyes
authored at least 201 papers
between 1985 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on zbmath.org
-
on kaust.edu.sa
-
on orcid.org
-
on id.loc.gov
-
on d-nb.info
On csauthors.net:
Bibliography
2024
J. Comput. Phys., January, 2024
High performance computing seismic redatuming by inversion with algebraic compression and multiple precisions.
Int. J. High Perform. Comput. Appl., 2024
Portability and scalability evaluation of large-scale statistical modeling and prediction software through HPC-ready containers.
Future Gener. Comput. Syst., 2024
Accelerating Mixed-Precision Out-of-Core Cholesky Factorization with Static Task Scheduling.
CoRR, 2024
Toward Capturing Genetic Epistasis From Multivariate Genome-Wide Association Studies Using Mixed-Precision Kernel Ridge Regression.
CoRR, 2024
Constructing artificial life and materials scientists with accelerated AI using Deep AndersoNN.
CoRR, 2024
GPU-Accelerated Vecchia Approximations of Gaussian Processes for Geospatial Data using Batched Matrix Computations.
Proceedings of the ISC High Performance 2024 Research Paper Proceedings (39th International Conference), 2024
Proceedings of the 2024 SIAM Conference on Parallel Processing for Scientific Computing, 2024
PETScML: Second-Order Solvers for Training Regression Problems in Scientific Machine Learning.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2024
Leveraging the High Bandwidth of Last-Level Cache for HPC Seismic Imaging Applications.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2024
Parallel Approximations for High-Dimensional Multivariate Normal Probability Computation in Confidence Region Detection Applications.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024
2023
Int. J. High Perform. Comput. Appl., March, 2023
IEEE Trans. Parallel Distributed Syst., February, 2023
Author Correction: The high-dimensional space of human diseases built from diagnosis records and mapped to genetic loci.
Nat. Comput. Sci., 2023
The high-dimensional space of human diseases built from diagnosis records and mapped to genetic loci.
Nat. Comput. Sci., 2023
Tile low-rank approximations of non-Gaussian space and space-time Tukey <i>g</i>-and-<i>h</i> random field likelihoods and predictions on large-scale systems.
J. Parallel Distributed Comput., 2023
A scheduling policy to save 10% of communication time in parallel fast Fourier transform.
Concurr. Comput. Pract. Exp., 2023
Proceedings of the High Performance Computing - 38th International Conference, 2023
Proceedings of the High Performance Computing - 38th International Conference, 2023
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023
Scaling the "Memory Wall" for Multi-Dimensional Seismic Processing with Algebraic Compression on Cerebras CS-2 Systems.
Proceedings of the International Conference for High Performance Computing, 2023
Proceedings of the International Conference for High Performance Computing, 2023
Efficient GPU-based Large MIMO Detection Algorithm for Next-Generation Communication Systems.
Proceedings of the IEEE Global Communications Conference, 2023
Reducing Data Motion and Energy Consumption of Geospatial Modeling Applications Using Automated Precision Conversion.
Proceedings of the IEEE International Conference on Cluster Computing, 2023
2022
Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC.
IEEE Trans. Parallel Distributed Syst., 2022
High-performance 3D Unstructured Mesh Deformation Using Rank Structured Matrix Computations.
ACM Trans. Parallel Comput., 2022
SIAM J. Sci. Comput., 2022
tlrmvnmvt: Computing High-Dimensional Multivariate Normal and Student- t Probabilities with Low-Rank Methods in R.
J. Stat. Softw., 2022
Int. J. High Perform. Comput. Appl., 2022
Comput. Sci. Eng., 2022
Optimized explicit Runge-Kutta schemes for high-order collocated discontinuous Galerkin methods for compressible fluid dynamics.
Comput. Math. Appl., 2022
Adv. Comput. Math., 2022
Reshaping Geostatistical Modeling and Prediction for Extreme-Scale Environmental Applications.
Proceedings of the SC22: International Conference for High Performance Computing, 2022
Parallel space-time likelihood optimization for air pollution prediction on large-scale systems.
Proceedings of the PASC '22: Platform for Advanced Scientific Computing Conference, Basel, Switzerland, June 27, 2022
Parallel Approximations of the Tukey g-and-h Likelihoods and Predictions for Non-Gaussian Geostatistics.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
Proceedings of the Euro-Par 2022: Parallel Processing, 2022
2021
IEEE Trans. Parallel Distributed Syst., 2021
Accelerating Seismic Redatuming Using Tile Low-Rank Approximations on NEC SX-Aurora TSUBASA.
Supercomput. Front. Innov., 2021
Nonlinear Preconditioning Strategies for Two-Phase Flows in Porous Media Discretized by a Fully Implicit Discontinuous Galerkin Method.
SIAM J. Sci. Comput., 2021
SIAM J. Sci. Comput., 2021
Exploiting low-rank covariance structures for computing high-dimensional normal and Student-t probabilities.
Stat. Comput., 2021
On the robustness and performance of entropy stable collocated discontinuous Galerkin methods.
J. Comput. Phys., 2021
Sum of Kronecker products representation and its Cholesky factorization for spatial covariance matrices from large grids.
Comput. Stat. Data Anal., 2021
Space-Fractional Diffusion with Variable Order and Diffusivity: Discretization and Direct Solution Strategies.
CoRR, 2021
H2OPUS-TLR: High Performance Tile Low Rank Symmetric Factorizations using Adaptive Randomized Approximation.
CoRR, 2021
High-Performance Partial Spectrum Computation for Symmetric eigenvalue problems and the SVD.
CoRR, 2021
An O(N) algorithm for computing expectation of N-dimensional truncated multi-variate normal distribution I: fundamentals.
Adv. Comput. Math., 2021
Meeting the real-time challenges of ground-based telescopes using low-rank matrix computations.
Proceedings of the International Conference for High Performance Computing, 2021
Leveraging PaRSEC Runtime Support to Tackle Challenging 3D Data-Sparse Matrix Problems.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021
Implications of Reduced Communication Precision in a Collocated Discontinuous Galerkin Finite Element Framework.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021
Outsmarting the Atmospheric Turbulence for Ground-Based Telescopes Using the Stochastic Levenberg-Marquardt Method.
Proceedings of the Euro-Par 2021: Parallel Processing, 2021
2020
IEEE Trans. Parallel Distributed Syst., 2020
A Multilayer Nonlinear Elimination Preconditioned Inexact Newton Method for Steady-State Incompressible Flow Problems in Three Dimensions.
SIAM J. Sci. Comput., 2020
Hierarchical Matrix Approximations of Hessians Arising in Inverse Problems Governed by PDEs.
SIAM J. Sci. Comput., 2020
Fully implicit hybrid two-level domain decomposition algorithms for two-phase flows in porous media on 3D unstructured grids.
J. Comput. Phys., 2020
Int. J. High Perform. Comput. Appl., 2020
Energy-conserving 3D elastic wave simulation with finite difference discretization on staggered grids with nonconforming interfaces.
CoRR, 2020
High Performance Multivariate Spatial Modeling for Geostatistical Data on Manycore Systems.
CoRR, 2020
Explicit coupling of acoustic and elastic wave propagation in finite difference simulations.
CoRR, 2020
Hierarchical Matrix Approximations of Hessians Arising in Inverse Problems Governed by PDEs.
CoRR, 2020
Performance / Complexity Trade-offs of the Sphere Decoder Algorithm for Massive MIMO Systems.
CoRR, 2020
Performance study of sustained petascale direct numerical simulation on Cray XC40 systems.
Concurr. Comput. Pract. Exp., 2020
Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization.
Proceedings of the High Performance Computing - 35th International Conference, 2020
Extreme-Scale Task-Based Cholesky Factorization Toward Climate and Weather Prediction Applications.
Proceedings of the PASC '20: Platform for Advanced Scientific Computing Conference, Geneva, Switzerland, June 29, 2020
Maximizing I/O Bandwidth for Reverse Time Migration on Heterogeneous Large-Scale Systems.
Proceedings of the Euro-Par 2020: Parallel Processing, 2020
2019
ACM Trans. Parallel Comput., 2019
ACM Trans. Math. Softw., 2019
ACM Trans. Math. Softw., 2019
Hierarchical Matrix Operations on GPUs: Matrix-Vector Multiplication and Compression.
ACM Trans. Math. Softw., 2019
Randomized GPU Algorithms for the Construction of Hierarchical Matrices from Matrix-Vector Operations.
SIAM J. Sci. Comput., 2019
SIAM J. Sci. Comput., 2019
Hierarchical-block conditioning approximations for high-dimensional multivariate normal probabilities.
Stat. Comput., 2019
J. Parallel Distributed Comput., 2019
J. Open Source Softw., 2019
Combining finite element and finite difference methods for isotropic elastic wave simulations in an energy-conserving manner.
J. Comput. Phys., 2019
SBP-SAT finite difference discretization of acoustic wave equations on staggered block-wise uniform grids.
J. Comput. Appl. Math., 2019
Comput. Stat. Data Anal., 2019
On the robustness and performance of entropy stable discontinuous collocation methods for the compressible Navie-Stokes equations.
CoRR, 2019
Solution of the 3D density-driven groundwater flow problem with uncertain porosity and permeability.
CoRR, 2019
Comput. Methods Appl. Math., 2019
Proceedings of the 9th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2019
Performance Analysis of Tile Low-Rank Cholesky Factorization Using PaRSEC Instrumentation Tools.
Proceedings of the IEEE/ACM International Workshop on Programming and Performance Visualization Tools, 2019
MLBS: Transparent Data Caching in Hierarchical Storage for Out-of-Core HPC Applications.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019
Geostatistical Modeling and Prediction Using Mixed Precision Tile Cholesky Factorization.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019
Leveraging Task-Based Polar Decomposition Using PARSEC on Massively Parallel Systems.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019
Asynchronous Task-Based Execution of the Reverse Time Migration for the Oil and Gas Industry.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019
2018
IEEE Trans. Parallel Distributed Syst., 2018
IEEE Trans. Parallel Distributed Syst., 2018
ExaGeoStat: A High Performance Unified Software for Geostatistics on Manycore Systems.
IEEE Trans. Parallel Distributed Syst., 2018
ACM Trans. Parallel Comput., 2018
Accelerated Cyclic Reduction: A distributed-memory fast solver for structured linear systems.
Parallel Comput., 2018
Batched QR and SVD algorithms on GPUs with applications in hierarchical matrix compression.
Parallel Comput., 2018
Parallel accelerated cyclic reduction preconditioner for three-dimensional elliptic PDEs with variable coefficients.
J. Comput. Appl. Math., 2018
Tile Low-Rank Approximation of Large-Scale Maximum Likelihood Estimation on Manycore Architectures.
CoRR, 2018
Comput. Vis. Sci., 2018
Extreme Computing for Extreme Adaptive Optics: The Key to Finding Life Outside our Solar System.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2018
Real-Time Massively Distributed Multi-object Adaptive Optics Simulations for the European Extremely Large Telescope.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018
Performance Assessment of Hybrid Parallelism for Large-Scale Reservoir Simulation on Multi- and Many-core Architectures.
Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018
Proceedings of the Euro-Par 2018: Parallel Processing, 2018
Proceedings of the Euro-Par 2018: Parallel Processing, 2018
Parallel Approximation of the Maximum Likelihood Estimation for the Prediction of Large-Scale Geostatistics Simulations.
Proceedings of the IEEE International Conference on Cluster Computing, 2018
2017
A scalable community detection algorithm for large graphs using stochastic block models.
Intell. Data Anal., 2017
ExaGeoStat: A High Performance Unified Framework for Geostatistics on Manycore Systems.
CoRR, 2017
Concurr. Comput. Pract. Exp., 2017
Tile Low Rank Cholesky Factorization for Climate/Weather Modeling Applications on Manycore Architectures.
Proceedings of the High Performance Computing - 32nd International Conference, 2017
Communication Reducing Algorithms for Distributed Hierarchical N-Body Problems with Boundary Distributions.
Proceedings of the High Performance Computing - 32nd International Conference, 2017
Proceedings of the Platform for Advanced Scientific Computing Conference, 2017
Performance Evaluation of Computation and Communication Kernels of the Fast Multipole Method on Intel Manycore Architecture.
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017
2016
ACM Trans. Math. Softw., 2016
KBLAS: An Optimized Library for Dense Matrix-Vector Multiplication on GPU Accelerators.
ACM Trans. Math. Softw., 2016
Convergence Analysis for the Multiplicative Schwarz Preconditioned Inexact Newton Algorithm.
SIAM J. Numer. Anal., 2016
Parallel Comput., 2016
A performance model for the communication in fast multipole methods on high-performance computing platforms.
Int. J. High Perform. Comput. Appl., 2016
CoRR, 2016
A Matrix-free Preconditioner for the Helmholtz Equation based on the Fast Multipole Method.
CoRR, 2016
CoRR, 2016
Performance optimization of Sparse Matrix-Vector Multiplication for multi-component PDE-based applications using GPUs.
Concurr. Comput. Pract. Exp., 2016
Proceedings of the High Performance Computing - 31st International Conference, 2016
On the Robustness and Prospects of Adaptive BDDC Methods for Finite Element Discretizations of Elliptic PDEs with High-Contrast Coefficients.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2016
Optimization of an Electromagnetics Code with Multicore Wavefront Diamond Blocking and Multi-dimensional Intra-Tile Parallelization.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016
Proceedings of the International Conference on Computational Science 2016, 2016
Proceedings of the Euro-Par 2016: Parallel Processing, 2016
Proceedings of the Euro-Par 2016: Parallel Processing, 2016
2015
Supercomput. Front. Innov., 2015
SIAM J. Sci. Comput., 2015
A parallel domain decomposition-based implicit method for the Cahn-Hilliard-Cook phase-field equation in 3D.
J. Comput. Phys., 2015
Smooth and robust solutions for Dirichlet boundary control of fluid-solid conjugate heat transfer problems.
J. Comput. Phys., 2015
Multi-dimensional intra-tile parallelization for memory-starved stencil computations.
CoRR, 2015
Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms.
CoRR, 2015
Exploring Shared-Memory Optimizations for an Unstructured Mesh CFD Application on Modern Parallel Systems.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015
Proceedings of the Euro-Par 2015: Parallel Processing, 2015
2014
Supercomput. Front. Innov., 2014
Towards energy efficiency and maximum computational intensity for stencil algorithms using wavefront diamond temporal blocking.
CoRR, 2014
A Performance Model for the Communication in Fast Multipole Methods on HPC Platforms.
CoRR, 2014
Pipelining Computational Stages of the Tomographic Reconstructor for Multi-Object Adaptive Optics on a Multi-GPU System.
Proceedings of the International Conference for High Performance Computing, 2014
High Performance Pseudo-analytical Simulation of Multi-Object Adaptive Optics over Multi-GPU Systems.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014
2013
it Inf. Technol., 2013
Optimizing the performance of streaming numerical kernels on the IBM Blue Gene/P PowerPC 450 processor.
Int. J. High Perform. Comput. Appl., 2013
Int. J. High Perform. Comput. Appl., 2013
Topic 14+16: High-Performance and Scientific Applications and Extreme-Scale Computing - (Introduction).
Proceedings of the Euro-Par 2013 Parallel Processing, 2013
2012
A Quasi-algebraic Multigrid Approach to Fracture Problems Based on Extended Finite Elements.
SIAM J. Sci. Comput., 2012
Numerical simulation of four-field extended magnetohydrodynamics in dynamically adaptive curvilinear coordinates via Newton-Krylov-Schwarz.
J. Comput. Phys., 2012
Proceedings of the High Performance Computing for Computational Science, 2012
Proceedings of the 12th IEEE International Conference on Data Mining, 2012
Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012
2011
Int. J. High Perform. Comput. Appl., 2011
Comput. Phys. Commun., 2011
Proceedings of the OpenMP in the Petascale Era - 7th International Workshop on OpenMP, 2011
2010
Proceedings of the Software Automatic Tuning, From Concepts to State-of-the-Art Results, 2010
2009
J. Comput. Chem., 2009
Int. J. High Perform. Comput. Appl., 2009
2008
2007
Additive Schwarz-based fully coupled implicit methods for resistive Hall magnetohydrodynamic problems.
J. Comput. Phys., 2007
Reconstructing parameters of the FitzHugh-Nagumo system from boundary potential measurements.
J. Comput. Neurosci., 2007
2006
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006
M06 - Issues for the future of supercomputing: impact of Moore's law and architecture on application performance.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006
Proceedings of the Grid-Based Problem Solving Environments, 2006
Proceedings of the Parallel Processing for Scientific Computing, 2006
2004
Proceedings of the Euro-Par 2004 Parallel Processing, 2004
2003
SIAM J. Sci. Comput., 2003
2002
2001
2000
Int. J. High Perform. Comput. Appl., 2000
Proceedings of the Proceedings Supercomputing 2000, 2000
Proceedings of the High Performance Computing, 2000
Four Horizons for Enhancing the Performance of Parallel Simulations Based on Partial Differential Equations.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000
1999
Parallel Distributed Comput. Pract., 1999
Parallel Distributed Comput. Pract., 1999
Proceedings of the ACM/IEEE Conference on Supercomputing, 1999
Parallelization of an Object-Oriented Unstructured Aeroacoustics Solver.
Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999
1998
SIAM J. Sci. Comput., 1998
1997
Proceedings of the Conference on Parallel Computational Fluid Dynamics 1997, 1997
1996
J. Parallel Distributed Comput., 1996
Proceedings of the Euro-Par '96 Parallel Processing, 1996
1995
Modeling Communication in Cluster Computing.
Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing, 1995
1994
SIAM J. Sci. Comput., 1994
A comparison of some domain decomposition and ILU preconditioned iterative methods for nonsymmetric elliptic problems.
Numer. Linear Algebra Appl., 1994
1992
Parallel Performance of Domain-Decomposed Preconditioned Krylov Methods for PDEs with Locally Uniform Refinement.
SIAM J. Sci. Comput., 1992
1989
Balanced Divide-and-Conquer Algorithms for the Fine-Grained Parallel Direct Solution of Dense and Banded Triangular Linear Systems and their Connection Machine Implementation.
Proceedings of the Fourth SIAM Conference on Parallel Processing for Scientific Computing, 1989
Parallel Domain Decomposition with Local Mesh Refinement.
Proceedings of the Fourth SIAM Conference on Parallel Processing for Scientific Computing, 1989
1987
Analysis of a Parallized Elliptic Solver for Reacting Flows-Abstract.
Proceedings of the Third SIAM Conference on Parallel Processing for Scientific Computing, 1987
1985
A comparison of domain decomposition techniques for elliptic partial differential equations and their parallel implementation.
Proceedings of the Selected Papers from the Second Conference on Parallel Processing for Scientific Computing, 1985