On rounding error resilience, maximal attainable accuracy and parallel performance of the pipelined Conjugate Gradients method for large-scale linear systems in PETSc.

[BibT_eX]

[DOI]

Siegfried Cools

Wim Vanroose

Emrullah Fatih Yetkin

Emmanuel Agullo

Luc Giraud

Proceedings of the Exascale Applications and Software Conference 2016, 2016

2015

Bridging the Gap between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015

Fast and Accurate Simulation of Multithreaded Sparse Linear Algebra Solvers.

[BibT_eX]

[DOI]

Proceedings of the 21st IEEE International Conference on Parallel and Distributed Systems, 2015

On the Resilience of Parallel Sparse Hybrid Solvers.

[BibT_eX]

[DOI]

Emmanuel Agullo

Luc Giraud

Mawussi Zounon

Proceedings of the 22nd IEEE International Conference on High Performance Computing, 2015

Task-Based Multifrontal QR Solver for GPU-Accelerated Multicore Architectures.

[BibT_eX]

[DOI]

Proceedings of the 22nd IEEE International Conference on High Performance Computing, 2015

2014

Task-Based FMM for Multicore Architectures.

[BibT_eX]

[DOI]

SIAM J. Sci. Comput., 2014

Block GMRES Method with Inexact Breakdowns and Deflated Restarting.

[BibT_eX]

[DOI]

Emmanuel Agullo

Luc Giraud

Yan-Fei Jing

SIAM J. Matrix Anal. Appl., 2014

Task-Based Programming for Seismic Imaging: Preliminary Results.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE International Conference on High Performance Computing and Communications, 2014

2013

Parallel algebraic domain decomposition solver for the solution of augmented systems.

[BibT_eX]

[DOI]

Adv. Eng. Softw., 2013

Multifrontal QR Factorization for Multicore Architectures over Runtime Systems.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2013 Parallel Processing, 2013

2012

Pipelining the Fast Multipole Method over a Runtime System

[BibT_eX]

[DOI]

CoRR, 2012

Poster: Matrices over Runtime Systems at Exascale.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

Abstract: Matrices Over Runtime Systems at Exascale.

[BibT_eX]

[DOI]

Proceedings of the 2012 SC Companion: High Performance Computing, 2012

2011

QCG-OMPI: MPI applications on grids.

[BibT_eX]

[DOI]

Future Gener. Comput. Syst., 2011

Fully Empirical Autotuned QR Factorization For Multicore Architectures

[BibT_eX]

[DOI]

CoRR, 2011

QR Factorization on a Multicore Node Enhanced with Multiple GPU Accelerators.

[BibT_eX]

[DOI]

Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011

A Fully Empirical Autotuned Dense QR Factorization for Multicore Architectures.

[BibT_eX]

[DOI]

Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011

LU factorization for accelerator-based systems.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE/ACS International Conference on Computer Systems and Applications, 2011

2010

Reducing the I/O Volume in Sparse Out-of-core Multifrontal Methods.

[BibT_eX]

[DOI]

Emmanuel Agullo

Abdou Guermouche

Jean-Yves L'Excellent

SIAM J. Sci. Comput., 2010

Towards an Efficient Tile Matrix Inversion of Symmetric Positive Definite Matrices on Multicore Architectures.

[BibT_eX]

[DOI]

Proceedings of the High Performance Computing for Computational Science - VECPAR 2010, 2010

Tile QR factorization with parallel panel processing for multicore architectures.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

QR factorization of tall and skinny matrices in a grid computing environment.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010

2009

Comparative study of one-sided factorizations with multiple software packages on multi-core hardware.

[BibT_eX]

[DOI]

Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009

2008

On the Out-Of-Core Factorization of Large Sparse Matrices. (Méthodes directes hors-mémoire (out-of-core) pour la résolution de systèmes linéaires creux de grande taille).

[BibT_eX]

[DOI]

Emmanuel Agullo

PhD thesis, 2008

A parallel out-of-core multifrontal method: Storage of factors on disk and analysis of models for an out-of-core active memory.

[BibT_eX]

[DOI]

Emmanuel Agullo

Abdou Guermouche

Jean-Yves L'Excellent

Parallel Comput., 2008

On the I/O Volume in Out-of-Core Multifrontal Methods with a Flexible Allocation Scheme.