David E. Keyes

Orcid: 0000-0002-4052-7224

Affiliations:
  • King Abdullah University of Science and Technology, Thuwal, Saudia Arabia
  • Columbia University, New York, NY, USA
  • Old Dominion University, Norfolk, VA, USA
  • Yale Unviersity, New Haven, CT, USA


According to our database1, David E. Keyes authored at least 201 papers between 1985 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Overlapping multiplicative Schwarz preconditioning for linear and nonlinear systems.
J. Comput. Phys., January, 2024

High performance computing seismic redatuming by inversion with algebraic compression and multiple precisions.
Int. J. High Perform. Comput. Appl., 2024

Portability and scalability evaluation of large-scale statistical modeling and prediction software through HPC-ready containers.
Future Gener. Comput. Syst., 2024

Accelerating AI Performance using Anderson Extrapolation on GPUs.
CoRR, 2024

Accelerating Mixed-Precision Out-of-Core Cholesky Factorization with Static Task Scheduling.
CoRR, 2024

Toward Capturing Genetic Epistasis From Multivariate Genome-Wide Association Studies Using Mixed-Precision Kernel Ridge Regression.
CoRR, 2024

Constructing artificial life and materials scientists with accelerated AI using Deep AndersoNN.
CoRR, 2024

Multidimensional deconvolution with shared bases.
CoRR, 2024

GPU-Accelerated Vecchia Approximations of Gaussian Processes for Geospatial Data using Batched Matrix Computations.
Proceedings of the ISC High Performance 2024 Research Paper Proceedings (39th International Conference), 2024

Cholesky Factorization of Tile Low Rank Matrices on GPUs.
Proceedings of the 2024 SIAM Conference on Parallel Processing for Scientific Computing, 2024

PETScML: Second-Order Solvers for Training Regression Problems in Scientific Machine Learning.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2024

Leveraging the High Bandwidth of Last-Level Cache for HPC Seismic Imaging Applications.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2024

Parallel Approximations for High-Dimensional Multivariate Normal Probability Computation in Confidence Region Detection Applications.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

2023
shmem4py: OpenSHMEM for Python.
Dataset, July, 2023

Exploiting temporal data reuse and asynchrony in the reverse time migration.
Int. J. High Perform. Comput. Appl., March, 2023

mpi4py.futures: MPI-Based Asynchronous Task Execution for Python.
IEEE Trans. Parallel Distributed Syst., February, 2023

Author Correction: The high-dimensional space of human diseases built from diagnosis records and mapped to genetic loci.
Nat. Comput. Sci., 2023

The high-dimensional space of human diseases built from diagnosis records and mapped to genetic loci.
Nat. Comput. Sci., 2023

Tile low-rank approximations of non-Gaussian space and space-time Tukey <i>g</i>-and-<i>h</i> random field likelihoods and predictions on large-scale systems.
J. Parallel Distributed Comput., 2023

shmem4py: OpenSHMEM for Python.
J. Open Source Softw., 2023

A scheduling policy to save 10% of communication time in parallel fast Fourier transform.
Concurr. Comput. Pract. Exp., 2023

Steering Customized AI Architectures for HPC Scientific Applications.
Proceedings of the High Performance Computing - 38th International Conference, 2023

GPU-Based Low-Precision Detection Approach for Massive MIMO Systems.
Proceedings of the High Performance Computing - 38th International Conference, 2023

shmem4py: High-Performance One-Sided Communication for Python Applications.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

Scaling the "Memory Wall" for Multi-Dimensional Seismic Processing with Algebraic Compression on Cerebras CS-2 Systems.
Proceedings of the International Conference for High Performance Computing, 2023

High-Performance SVD Partial Spectrum Computation.
Proceedings of the International Conference for High Performance Computing, 2023

Efficient GPU-based Large MIMO Detection Algorithm for Next-Generation Communication Systems.
Proceedings of the IEEE Global Communications Conference, 2023

Reducing Data Motion and Energy Consumption of Geospatial Modeling Applications Using Automated Precision Conversion.
Proceedings of the IEEE International Conference on Cluster Computing, 2023

2022
Accelerating Geostatistical Modeling and Prediction With Mixed-Precision Computations: A High-Productivity Approach With PaRSEC.
IEEE Trans. Parallel Distributed Syst., 2022

High-performance 3D Unstructured Mesh Deformation Using Rank Structured Matrix Computations.
ACM Trans. Parallel Comput., 2022

A Nonlinear Elimination Preconditioned Inexact Newton Algorithm.
SIAM J. Sci. Comput., 2022

tlrmvnmvt: Computing High-Dimensional Multivariate Normal and Student- t Probabilities with Low-Rank Methods in R.
J. Stat. Softw., 2022

Performance analysis of relaxation Runge-Kutta methods.
Int. J. High Perform. Comput. Appl., 2022

Responsibly Reckless Matrix Algorithms for HPC Scientific Applications.
Comput. Sci. Eng., 2022

Optimized explicit Runge-Kutta schemes for high-order collocated discontinuous Galerkin methods for compressible fluid dynamics.
Comput. Math. Appl., 2022

H2Opus: a distributed-memory multi-GPU software package for non-local operators.
Adv. Comput. Math., 2022

Reshaping Geostatistical Modeling and Prediction for Extreme-Scale Environmental Applications.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Parallel space-time likelihood optimization for air pollution prediction on large-scale systems.
Proceedings of the PASC '22: Platform for Advanced Scientific Computing Conference, Basel, Switzerland, June 27, 2022

Parallel Approximations of the Tukey g-and-h Likelihoods and Predictions for Non-Gaussian Geostatistics.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

A Framework to Exploit Data Sparsity in Tile Low-Rank Cholesky Factorization.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

High-Performance Spatial Data Compression for Scientific Applications.
Proceedings of the Euro-Par 2022: Parallel Processing, 2022

2021
High Performance Multivariate Geospatial Statistics on Manycore Systems.
IEEE Trans. Parallel Distributed Syst., 2021

Accelerating Seismic Redatuming Using Tile Low-Rank Approximations on NEC SX-Aurora TSUBASA.
Supercomput. Front. Innov., 2021

Nonlinear Preconditioning Strategies for Two-Phase Flows in Porous Media Discretized by a Fully Implicit Discontinuous Galerkin Method.
SIAM J. Sci. Comput., 2021

Approximate Error Bounds on Solutions of Nonlinearly Preconditioned PDEs.
SIAM J. Sci. Comput., 2021

Exploiting low-rank covariance structures for computing high-dimensional normal and Student-t probabilities.
Stat. Comput., 2021

On the robustness and performance of entropy stable collocated discontinuous Galerkin methods.
J. Comput. Phys., 2021

Sum of Kronecker products representation and its Cholesky factorization for spatial covariance matrices from large grids.
Comput. Stat. Data Anal., 2021

Space-Fractional Diffusion with Variable Order and Diffusivity: Discretization and Direct Solution Strategies.
CoRR, 2021

H2OPUS-TLR: High Performance Tile Low Rank Symmetric Factorizations using Adaptive Randomized Approximation.
CoRR, 2021

High-Performance Partial Spectrum Computation for Symmetric eigenvalue problems and the SVD.
CoRR, 2021

The Arab world prepares the exascale workforce.
Commun. ACM, 2021

An O(N) algorithm for computing expectation of N-dimensional truncated multi-variate normal distribution I: fundamentals.
Adv. Comput. Math., 2021

Meeting the real-time challenges of ground-based telescopes using low-rank matrix computations.
Proceedings of the International Conference for High Performance Computing, 2021

Leveraging PaRSEC Runtime Support to Tackle Challenging 3D Data-Sparse Matrix Problems.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

Implications of Reduced Communication Precision in a Collocated Discontinuous Galerkin Finite Element Framework.
Proceedings of the 2021 IEEE High Performance Extreme Computing Conference, 2021

Outsmarting the Atmospheric Turbulence for Ground-Based Telescopes Using the Stochastic Levenberg-Marquardt Method.
Proceedings of the Euro-Par 2021: Parallel Processing, 2021

2020
Abstraction Layer For Standardizing APIs of Task-Based Engines.
IEEE Trans. Parallel Distributed Syst., 2020

A Multilayer Nonlinear Elimination Preconditioned Inexact Newton Method for Steady-State Incompressible Flow Problems in Three Dimensions.
SIAM J. Sci. Comput., 2020

Hierarchical Matrix Approximations of Hessians Arising in Inverse Problems Governed by PDEs.
SIAM J. Sci. Comput., 2020

Fully implicit hybrid two-level domain decomposition algorithms for two-phase flows in porous media on 3D unstructured grids.
J. Comput. Phys., 2020

Asynchronous computations for solving the acoustic wave propagation equation.
Int. J. High Perform. Comput. Appl., 2020

Energy-conserving 3D elastic wave simulation with finite difference discretization on staggered grids with nonconforming interfaces.
CoRR, 2020

High Performance Multivariate Spatial Modeling for Geostatistical Data on Manycore Systems.
CoRR, 2020

Explicit coupling of acoustic and elastic wave propagation in finite difference simulations.
CoRR, 2020

Hierarchical Matrix Approximations of Hessians Arising in Inverse Problems Governed by PDEs.
CoRR, 2020

Performance / Complexity Trade-offs of the Sphere Decoder Algorithm for Massive MIMO Systems.
CoRR, 2020

Performance study of sustained petascale direct numerical simulation on Cray XC40 systems.
Concurr. Comput. Pract. Exp., 2020

Solving Acoustic Boundary Integral Equations Using High Performance Tile Low-Rank LU Factorization.
Proceedings of the High Performance Computing - 35th International Conference, 2020

Extreme-Scale Task-Based Cholesky Factorization Toward Climate and Weather Prediction Applications.
Proceedings of the PASC '20: Platform for Advanced Scientific Computing Conference, Geneva, Switzerland, June 29, 2020

Maximizing I/O Bandwidth for Reverse Time Migration on Heterogeneous Large-Scale Systems.
Proceedings of the Euro-Par 2020: Parallel Processing, 2020

2019
Massively Parallel Polar Decomposition on Distributed-memory Systems.
ACM Trans. Parallel Comput., 2019

A QDWH-based SVD Software Framework on Distributed-memory Manycore Systems.
ACM Trans. Math. Softw., 2019

Batched Triangular Dense Linear Algebra Kernels for Very Small Matrix Sizes on GPUs.
ACM Trans. Math. Softw., 2019

Hierarchical Matrix Operations on GPUs: Matrix-Vector Multiplication and Compression.
ACM Trans. Math. Softw., 2019

Randomized GPU Algorithms for the Construction of Hierarchical Matrices from Matrix-Vector Operations.
SIAM J. Sci. Comput., 2019

Extreme Scale FMM-Accelerated Boundary Integral Equation Solver for Wave Scattering.
SIAM J. Sci. Comput., 2019

Hierarchical-block conditioning approximations for high-dimensional multivariate normal probabilities.
Stat. Comput., 2019

Fast parallel multidimensional FFT using advanced MPI.
J. Parallel Distributed Comput., 2019

mpi4py-fft: Parallel Fast Fourier Transforms with MPI for Python.
J. Open Source Softw., 2019

Combining finite element and finite difference methods for isotropic elastic wave simulations in an energy-conserving manner.
J. Comput. Phys., 2019

SBP-SAT finite difference discretization of acoustic wave equations on staggered block-wise uniform grids.
J. Comput. Appl. Math., 2019

Likelihood approximation with hierarchical matrices for large spatial datasets.
Comput. Stat. Data Anal., 2019

On the robustness and performance of entropy stable discontinuous collocation methods for the compressible Navie-Stokes equations.
CoRR, 2019

ExaGeoStatR: A Package for Large-Scale Geostatistics in R.
CoRR, 2019

Solution of the 3D density-driven groundwater flow problem with uncertain porosity and permeability.
CoRR, 2019

Tucker Tensor Analysis of Matérn Functions in Spatial Statistics.
Comput. Methods Appl. Math., 2019

Mixed-Precision Tomographic Reconstructor Computations on Hardware Accelerators.
Proceedings of the 9th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2019

Performance Analysis of Tile Low-Rank Cholesky Factorization Using PaRSEC Instrumentation Tools.
Proceedings of the IEEE/ACM International Workshop on Programming and Performance Visualization Tools, 2019

MLBS: Transparent Data Caching in Hierarchical Storage for Out-of-Core HPC Applications.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

Geostatistical Modeling and Prediction Using Mixed Precision Tile Cholesky Factorization.
Proceedings of the 26th IEEE International Conference on High Performance Computing, 2019

Leveraging Task-Based Polar Decomposition Using PARSEC on Massively Parallel Systems.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019

Asynchronous Task-Based Execution of the Reverse Time Migration for the Oil and Gas Industry.
Proceedings of the 2019 IEEE International Conference on Cluster Computing, 2019

2018
Asynchronous Task-Based Polar Decomposition on Single Node Manycore Architectures.
IEEE Trans. Parallel Distributed Syst., 2018

Optimizations of Unstructured Aerodynamics Computations for Many-core Architectures.
IEEE Trans. Parallel Distributed Syst., 2018

ExaGeoStat: A High Performance Unified Software for Geostatistics on Manycore Systems.
IEEE Trans. Parallel Distributed Syst., 2018

Multidimensional Intratile Parallelization for Memory-Starved Stencil Computations.
ACM Trans. Parallel Comput., 2018

A Note on Adaptive Nonlinear Preconditioning Techniques.
SIAM J. Sci. Comput., 2018

Accelerated Cyclic Reduction: A distributed-memory fast solver for structured linear systems.
Parallel Comput., 2018

Batched QR and SVD algorithms on GPUs with applications in hierarchical matrix compression.
Parallel Comput., 2018

Parallel accelerated cyclic reduction preconditioner for three-dimensional elliptic PDEs with variable coefficients.
J. Comput. Appl. Math., 2018

Big data and extreme-scale computing.
Int. J. High Perform. Comput. Appl., 2018

Tile Low-Rank Approximation of Large-Scale Maximum Likelihood Estimation on Manycore Architectures.
CoRR, 2018

Fast multipole preconditioners for sparse matrices arising from elliptic equations.
Comput. Vis. Sci., 2018

Extreme Computing for Extreme Adaptive Optics: The Key to Finding Life Outside our Solar System.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2018

Real-Time Massively Distributed Multi-object Adaptive Optics Simulations for the European Extremely Large Telescope.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium, 2018

Performance Assessment of Hybrid Parallelism for Large-Scale Reservoir Simulation on Multi- and Many-core Architectures.
Proceedings of the 2018 IEEE High Performance Extreme Computing Conference, 2018

Tile Low-Rank GEMM Using Batched Operations on GPUs.
Proceedings of the Euro-Par 2018: Parallel Processing, 2018

Exploiting Data Sparsity for Large-Scale Matrix Computations.
Proceedings of the Euro-Par 2018: Parallel Processing, 2018

Parallel Approximation of the Maximum Likelihood Estimation for the Prediction of Large-Scale Geostatistics Simulations.
Proceedings of the IEEE International Conference on Cluster Computing, 2018

2017
A scalable community detection algorithm for large graphs using stochastic block models.
Intell. Data Anal., 2017

ExaGeoStat: A High Performance Unified Framework for Geostatistics on Manycore Systems.
CoRR, 2017

A framework for dense triangular matrix kernels on various manycore architectures.
Concurr. Comput. Pract. Exp., 2017

Tile Low Rank Cholesky Factorization for Climate/Weather Modeling Applications on Manycore Architectures.
Proceedings of the High Performance Computing - 32nd International Conference, 2017

Communication Reducing Algorithms for Distributed Hierarchical N-Body Problems with Boundary Distributions.
Proceedings of the High Performance Computing - 32nd International Conference, 2017

Asynchronous Task-Based Parallelization of Algebraic Multigrid.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2017

Performance Evaluation of Computation and Communication Kernels of the Fast Multipole Method on Intel Manycore Architecture.
Proceedings of the Euro-Par 2017: Parallel Processing - 23rd International Conference on Parallel and Distributed Computing, Santiago de Compostela, Spain, August 28, 2017

2016
A High Performance QDWH-SVD Solver Using Hardware Accelerators.
ACM Trans. Math. Softw., 2016

KBLAS: An Optimized Library for Dense Matrix-Vector Multiplication on GPU Accelerators.
ACM Trans. Math. Softw., 2016

Accelerated Dimension-Independent Adaptive Metropolis.
SIAM J. Sci. Comput., 2016

Convergence Analysis for the Multiplicative Schwarz Preconditioned Inexact Newton Algorithm.
SIAM J. Numer. Anal., 2016

Unstructured computational aerodynamics on many integrated core architecture.
Parallel Comput., 2016

A performance model for the communication in fast multipole methods on high-performance computing platforms.
Int. J. High Perform. Comput. Appl., 2016

Fast Multipole Method as a Matrix-Free Hierarchical Low-Rank Approximation.
CoRR, 2016

Research and Education in Computational Science and Engineering.
CoRR, 2016

A Matrix-free Preconditioner for the Helmholtz Equation based on the Fast Multipole Method.
CoRR, 2016

A Direct Elliptic Solver Based on Hierarchically Low-rank Schur Complements.
CoRR, 2016

Performance optimization of Sparse Matrix-Vector Multiplication for multi-component PDE-based applications using GPUs.
Concurr. Comput. Pract. Exp., 2016

Efficiency of High Order Spectral Element Methods on Petascale Architectures.
Proceedings of the High Performance Computing - 31st International Conference, 2016

On the Robustness and Prospects of Adaptive BDDC Methods for Finite Element Discretizations of Elliptic PDEs with High-Contrast Coefficients.
Proceedings of the Platform for Advanced Scientific Computing Conference, 2016

Optimization of an Electromagnetics Code with Multicore Wavefront Diamond Blocking and Multi-dimensional Intra-Tile Parallelization.
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016

Efficient Sphere Detector Algorithm for Massive MIMO using GPU Hardware Accelerator.
Proceedings of the International Conference on Computational Science 2016, 2016

High Performance Polar Decomposition on Distributed Memory Systems.
Proceedings of the Euro-Par 2016: Parallel Processing, 2016

Redesigning Triangular Dense Matrix Computations on GPUs.
Proceedings of the Euro-Par 2016: Parallel Processing, 2016

2015
Dense Matrix Computations on NUMA Architectures with Distance-Aware Work Stealing.
Supercomput. Front. Innov., 2015

Multicore-Optimized Wavefront Diamond Blocking for Optimizing Stencil Updates.
SIAM J. Sci. Comput., 2015

Field-Split Preconditioned Inexact Newton Algorithms.
SIAM J. Sci. Comput., 2015

A parallel domain decomposition-based implicit method for the Cahn-Hilliard-Cook phase-field equation in 3D.
J. Comput. Phys., 2015

Smooth and robust solutions for Dirichlet boundary control of fluid-solid conjugate heat transfer problems.
J. Comput. Phys., 2015

Multi-dimensional intra-tile parallelization for memory-starved stencil computations.
CoRR, 2015

Design and Optimization of OpenFOAM-based CFD Applications for Hybrid and Heterogeneous HPC Platforms.
CoRR, 2015

Exploring Shared-Memory Optimizations for an Unstructured Mesh CFD Application on Modern Parallel Systems.
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015

Composing Algorithmic Skeletons to Express High-Performance Scientific Applications.
Proceedings of the 29th ACM on International Conference on Supercomputing, 2015

High Performance Multi-GPU SpMV for Multi-component PDE-Based Applications.
Proceedings of the Euro-Par 2015: Parallel Processing, 2015

2014
Communication Complexity of the Fast Multipole Method and its Algebraic Variants.
Supercomput. Front. Innov., 2014

Towards energy efficiency and maximum computational intensity for stencil algorithms using wavefront diamond temporal blocking.
CoRR, 2014

A Performance Model for the Communication in Fast Multipole Methods on HPC Platforms.
CoRR, 2014

Asynchronous Execution of the Fast Multipole Method Using Charm++.
CoRR, 2014

Pipelining Computational Stages of the Tomographic Reconstructor for Multi-Object Adaptive Optics on a Multi-GPU System.
Proceedings of the International Conference for High Performance Computing, 2014

High Performance Pseudo-analytical Simulation of Multi-Object Adaptive Optics over Multi-GPU Systems.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2013
The Miracle, Mandate and Mirage of High Performance Computing.
it Inf. Technol., 2013

Optimizing the performance of streaming numerical kernels on the IBM Blue Gene/P PowerPC 450 processor.
Int. J. High Perform. Comput. Appl., 2013

Multiphysics simulations: Challenges and opportunities.
Int. J. High Perform. Comput. Appl., 2013

Topic 14+16: High-Performance and Scientific Applications and Extreme-Scale Computing - (Introduction).
Proceedings of the Euro-Par 2013 Parallel Processing, 2013

2012
A Quasi-algebraic Multigrid Approach to Fracture Problems Based on Extended Finite Elements.
SIAM J. Sci. Comput., 2012

Numerical simulation of four-field extended magnetohydrodynamics in dynamically adaptive curvilinear coordinates via Newton-Krylov-Schwarz.
J. Comput. Phys., 2012

Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators.
Proceedings of the High Performance Computing for Computational Science, 2012

Multiplicative Algorithms for Constrained Non-negative Matrix Factorization.
Proceedings of the 12th IEEE International Conference on Data Mining, 2012

Systematic Approach in Optimizing Numerical Memory-Bound Kernels on GPU.
Proceedings of the Euro-Par 2012: Parallel Processing Workshops, 2012

2011
Special Section: 2010 Copper Mountain Conference.
SIAM J. Sci. Comput., 2011

The International Exascale Software Project roadmap.
Int. J. High Perform. Comput. Appl., 2011

Moving grids for magnetic reconnection via Newton-Krylov methods.
Comput. Phys. Commun., 2011

Hybrid Programming Model for Implicit PDE Simulations on Multicore Architectures.
Proceedings of the OpenMP in the Petascale Era - 7th International Workshop on OpenMP, 2011

2010
Application of Alternating Decision Trees in Selecting Sparse Linear Solvers.
Proceedings of the Software Automatic Tuning, From Concepts to State-of-the-Art Results, 2010

2009
Linear augmented Slater-type orbital method for free standing clusters.
J. Comput. Chem., 2009

Partial Differential Equation-Based Applications and Solvers At Extreme Scale.
Int. J. High Perform. Comput. Appl., 2009

Modeling wildland fire propagation with level set methods.
Comput. Math. Appl., 2009

2008
Special Issue on Computational Science and Engineering.
SIAM J. Sci. Comput., 2008

2007
Additive Schwarz-based fully coupled implicit methods for resistive Hall magnetohydrodynamic problems.
J. Comput. Phys., 2007

Reconstructing parameters of the FitzHugh-Nagumo system from boundary potential measurements.
J. Comput. Neurosci., 2007

Petaflop/s, Seriously.
Proceedings of the High Performance Computing, 2007

2006
Multi-core issues - Multi-Core for HPC: breakthrough or breakdown?
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

M06 - Issues for the future of supercomputing: impact of Moore's law and architecture on application performance.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006

Grid-based Image Registration.
Proceedings of the Grid-Based Problem Solving Environments, 2006

Parallel Algorithms for PDE-Constrained Optimization.
Proceedings of the Parallel Processing for Scientific Computing, 2006

2004
Topic 11: Numerical Algorithms.
Proceedings of the Euro-Par 2004 Parallel Processing, 2004

2003
Pseudotransient Continuation and Differential-Algebraic Equations.
SIAM J. Sci. Comput., 2003

2002
Nonlinearly Preconditioned Inexact Newton Algorithms.
SIAM J. Sci. Comput., 2002

2001
High-performance parallel implicit CFD.
Parallel Comput., 2001

2000
Globalized Newton-Krylov-Schwarz Algorithms and Software for Parallel Implicit CFD.
Int. J. High Perform. Comput. Appl., 2000

Performance Modeling and Tuning of an Unstructured Mesh CFD Application.
Proceedings of the Proceedings Supercomputing 2000, 2000

Analyzing the Parallel Scalability of an Implicit Unstructured Mesh CFD Code.
Proceedings of the High Performance Computing, 2000

Four Horizons for Enhancing the Performance of Parallel Simulations Based on Partial Differential Equations.
Proceedings of the Euro-Par 2000, Parallel Processing, 6th International Euro-Par Conference, Munich, Germany, August 29, 2000

1999
Adapting to Hostile Architectural Environments.
Parallel Distributed Comput. Pract., 1999

Three Parallel Programming Paradigms: Comparisons on an Archetypal PDE Computation.
Parallel Distributed Comput. Pract., 1999

Achieving High Sustained Performance in an Unstructured Mesh CFD Application.
Proceedings of the ACM/IEEE Conference on Supercomputing, 1999

Parallelization of an Object-Oriented Unstructured Aeroacoustics Solver.
Proceedings of the Ninth SIAM Conference on Parallel Processing for Scientific Computing, 1999

1998
Parallel Newton-Krylov-Schwarz Algorithms for the Transonic Full Potential Equation.
SIAM J. Sci. Comput., 1998

1997
Parallel Implicit PDE Computations.
Proceedings of the Conference on Parallel Computational Fluid Dynamics 1997, 1997

1996
A Hyperbolic Model for Communications in Layered Parallel Processing Environments.
J. Parallel Distributed Comput., 1996

Evaluating the Hyperbolic Model on a Variety of Architectures.
Proceedings of the Euro-Par '96 Parallel Processing, 1996

1995
Modeling Communication in Cluster Computing.
Proceedings of the Seventh SIAM Conference on Parallel Processing for Scientific Computing, 1995

1994
Towards Polyalgorithmic Linear System Solvers for Nonlinear Elliptic Problems.
SIAM J. Sci. Comput., 1994

A comparison of some domain decomposition and ILU preconditioned iterative methods for nonsymmetric elliptic problems.
Numer. Linear Algebra Appl., 1994

1992
Domain Decomposition with Local Mesh Refinement.
SIAM J. Sci. Comput., 1992

Parallel Performance of Domain-Decomposed Preconditioned Krylov Methods for PDEs with Locally Uniform Refinement.
SIAM J. Sci. Comput., 1992

1989
Domain decomposition on parallel computers.
IMPACT Comput. Sci. Eng., 1989

Balanced Divide-and-Conquer Algorithms for the Fine-Grained Parallel Direct Solution of Dense and Banded Triangular Linear Systems and their Connection Machine Implementation.
Proceedings of the Fourth SIAM Conference on Parallel Processing for Scientific Computing, 1989

Parallel Domain Decomposition with Local Mesh Refinement.
Proceedings of the Fourth SIAM Conference on Parallel Processing for Scientific Computing, 1989

1987
Analysis of a Parallized Elliptic Solver for Reacting Flows-Abstract.
Proceedings of the Third SIAM Conference on Parallel Processing for Scientific Computing, 1987

1985
A comparison of domain decomposition techniques for elliptic partial differential equations and their parallel implementation.
Proceedings of the Selected Papers from the Second Conference on Parallel Processing for Scientific Computing, 1985


  Loading...