Sandra Catalán

Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021

2020

sLASs: A fully automatic auto-tuned linear algebra library based on OpenMP extensions implemented in OmpSs (LASs Library).

[BibT_eX]

[DOI]

J. Parallel Distributed Comput., 2020

Programming parallel dense matrix factorizations with look-ahead and OpenMP.

[BibT_eX]

[DOI]

Adrián Castelló

Clust. Comput., 2020

Towards an Auto-Tuned and Task-Based SpMV (LASs Library).

[BibT_eX]

[DOI]

Proceedings of the OpenMP: Portable Multi-Level Parallelism on Modern Systems, 2020

2019

Dynamic look-ahead in the reduction to band form for the singular value decomposition.

[BibT_eX]

[DOI]

Rocío Carratalá-Sáez

Parallel Comput., 2019

Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD.

[BibT_eX]

[DOI]

Numer. Algorithms, 2019

A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization With Partial Pivoting.

[BibT_eX]

[DOI]

Robert A. van de Geijn

IEEE Access, 2019

Teaching on Demand: an HPC Experience.

[BibT_eX]

[DOI]

Rocío Carratalá-Sáez

Sergio Iserte

Leonel Antonio Toledo Díaz

Proceedings of the 2019 IEEE/ACM Workshop on Education for High-Performance Computing, 2019

BLAS-3 Optimized by OmpSs Regions (LASs Library).

[BibT_eX]

[DOI]

Proceedings of the 27th Euromicro International Conference on Parallel, 2019

Tasking in Accelerators: Performance Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 20th International Conference on Parallel and Distributed Computing, 2019

Accelerating Conjugate Gradient using OmpSs.

[BibT_eX]

[DOI]

Pedro Valero-Lara

Proceedings of the 20th International Conference on Parallel and Distributed Computing, 2019

2018

Multithreaded Dense Linear Algebra on Asymmetric Multi-core Processors.

[BibT_eX]

[DOI]

PhD thesis, 2018

Static scheduling of the LU factorization with look-ahead on asymmetric multicore processors.

[BibT_eX]

[DOI]

Parallel Comput., 2018

Energy balance between voltage-frequency scaling and resilience for linear algebra routines on low-power multicore architectures.

[BibT_eX]

[DOI]

Parallel Comput., 2018

Two-sided orthogonal reductions to condensed forms on asymmetric multicore processors.

[BibT_eX]

[DOI]

Pedro Alonso

Parallel Comput., 2018

Multi-threaded dense linear algebra libraries for low-power asymmetric multicore processors.

[BibT_eX]

[DOI]

Chris Adeniyi-Jones

J. Comput. Sci., 2018

Reduction to Band Form for the Singular Value Decomposition on Graphics Accelerators.

[BibT_eX]

[DOI]

Proceedings of the 9th International Workshop on Programming Models and Applications for Multicores and Manycores, 2018

2017

Time and energy modeling of a high-performance multi-threaded Cholesky factorization.

[BibT_eX]

[DOI]

J. Supercomput., 2017

Revisiting conventional task schedulers to exploit asymmetry in multi-core architectures for dense linear algebra operations.

[BibT_eX]

[DOI]

Parallel Comput., 2017

Two-Sided Reduction to Compact Band Forms with Look-Ahead.

[BibT_eX]

[DOI]

CoRR, 2017

Reduction to Tridiagonal Form for Symmetric Eigenproblems on Asymmetric Multicore Processors.

[BibT_eX]

[DOI]

Pedro Alonso

Proceedings of the 8th International Workshop on Programming Models and Applications for Multicores and Manycores, 2017

Static Versus Dynamic Task Scheduling of the Lu Factorization on ARM big. LITTLE Architectures.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium Workshops, 2017

2016

An analytical methodology to derive power models based on hardware and software metrics.

[BibT_eX]

[DOI]

Manuel F. Dolz

Julian M. Kunkel

Konstantinos Chasapis

Dimitrios S. Nikolopoulos

Comput. Sci. Res. Dev., 2016

Evaluating fault tolerance on asymmetric multicore systems-on-chip using iso-metrics.

[BibT_eX]

[DOI]

Charalampos Chalios

IET Comput. Digit. Tech., 2016

Architecture-aware configuration and scheduling of matrix multiplication on asymmetric multicore processors.

[BibT_eX]

[DOI]

Clust. Comput., 2016

Refactoring Conventional Task Schedulers to Exploit Asymmetric ARM big.LITTLE Architectures in Dense Linear Algebra.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, 2016

The Impact of Panel Factorization on the Gauss-Huard Algorithm for the Solution of Linear Systems on Modern Architectures.

[BibT_eX]

[DOI]

Pablo Ezzatti

Alfredo Remón

Proceedings of the Algorithms and Architectures for Parallel Processing, 2016

The Impact of Voltage-Frequency Scaling for the Matrix-Vector Product on the IBM POWER8.

[BibT_eX]

[DOI]

A. Cristiano I. Malossi

Costas Bekas

Proceedings of the Euro-Par 2016: Parallel Processing, 2016

2015

Time and energy modeling of high-performance Level-3 BLAS on x86 architectures.

[BibT_eX]

[DOI]

Simul. Model. Pract. Theory, 2015

Evaluating the performance and energy efficiency of the COSMO-ART model system.

[BibT_eX]

[DOI]

Comput. Sci. Res. Dev., 2015

Reducing the cost of power monitoring with DC wattmeters.

[BibT_eX]

[DOI]

M. Asunción Castaño

Comput. Sci. Res. Dev., 2015

Performance and Energy Optimization of Matrix Multiplication on Asymmetric big.LITTLE Processors.

[BibT_eX]

[DOI]

CoRR, 2015

Multi-Threaded Dense Linear Algebra Libraries for Low-Power Asymmetric Multicore Processors.

[BibT_eX]

[DOI]

CoRR, 2015

Performance and Fault Tolerance of Preconditioned Iterative Solvers on Low-Power ARM Architectures.

[BibT_eX]

[DOI]

José Ignacio Aliaga

Dimitrios S. Nikolopoulos

Charalampos Chalios

Proceedings of the Parallel Computing: On the Road to Exascale, 2015

2014

Assessing Power Monitoring Approaches for Energy and Power Analysis of Computers.

[BibT_eX]

[DOI]

Mohammed el Mehdi Diouri

Sustain. Comput. Informatics Syst., 2014

Automatic detection of power bottlenecks in parallel scientific applications.

[BibT_eX]

[DOI]

Comput. Sci. Res. Dev., 2014

Analyzing the Energy Efficiency of the Memory Subsystem in Multicore Processors.

[BibT_eX]

[DOI]

Jorge González-Domínguez

Proceedings of the IEEE International Symposium on Parallel and Distributed Processing with Applications, 2014

2013

Solving Some Mysteries in Power Monitoring of Servers: Take Care of Your Wattmeters!

[BibT_eX]

[DOI]

Mohammed el Mehdi Diouri