Andrés Tomás

Orcid: 0000-0003-3969-2174

According to our database1, Andrés Tomás authored at least 41 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Parallel Reduced Order Modeling for Digital Twins using High-Performance Computing Workflows.
CoRR, 2024

2023
Fast truncated SVD of sparse and dense matrices on graphics processors.
Int. J. High Perform. Comput. Appl., July, 2023

Performance-energy trade-offs of deep learning convolution algorithms on ARM processors.
J. Supercomput., June, 2023

Compressed basis GMRES on high-performance graphics processing units.
Int. J. High Perform. Comput. Appl., March, 2023

Reformulating the direct convolution for high-performance deep learning inference on ARM processors.
J. Syst. Archit., February, 2023

Sparse matrix-vector and matrix-multivector products for the truncated SVD on graphics processors.
Concurr. Comput. Pract. Exp., 2023

Tall-and-Skinny QR Factorization for Clusters of GPUs Using High-Performance Building Blocks.
Proceedings of the Euro-Par 2023: Parallel Processing Workshops - Euro-Par 2023 International Workshops, Limassol, Cyprus, August 28, 2023

2022
BestOf: an online implementation selector for the training and inference of deep neural networks.
J. Supercomput., 2022

High performance and energy efficient inference for deep learning on multicore ARM processors using general optimization techniques and BLIS.
J. Syst. Archit., 2022

Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units.
Concurr. Comput. Pract. Exp., 2022

2020
Tall-and-skinny QR factorization with approximate Householder reflectors on graphics processors.
J. Supercomput., 2020

Compressed Basis GMRES on High Performance GPUs.
CoRR, 2020

Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs.
Proceedings of the Euro-Par 2020: Parallel Processing Workshops, 2020

2019
FloatX: A C++ Library for Customized Floating-Point Arithmetic.
ACM Trans. Math. Softw., 2019

Dynamic look-ahead in the reduction to band form for the singular value decomposition.
Parallel Comput., 2019

Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD.
Numer. Algorithms, 2019

Cholesky and Gram-Schmidt Orthogonalization for Tall-and-Skinny QR Factorizations on Graphics Processors.
Proceedings of the Euro-Par 2019: Parallel Processing, 2019

2018
Residual Replacement in Mixed-Precision Iterative Refinement for Sparse Linear Systems.
Proceedings of the High Performance Computing, 2018

Reduction to Band Form for the Singular Value Decomposition on Graphics Accelerators.
Proceedings of the 9th International Workshop on Programming Models and Applications for Multicores and Manycores, 2018

Fast Blocking of Householder Reflectors on Graphics Processors.
Proceedings of the 26th Euromicro International Conference on Parallel, 2018

The transprecision computing paradigm: Concept, design, and applications.
Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018

2017
Empirical Study and Modeling of Vehicular Communications at Intersections in the 5 GHz Band.
Mob. Inf. Syst., 2017

Two-Sided Reduction to Compact Band Forms with Look-Ahead.
CoRR, 2017

On the impact of urban intersection characteristics in vehicular to vehicular (V2V) communications.
Proceedings of the 13th International Wireless Communications and Mobile Computing Conference, 2017

Evaluating the use of sub-gigahertz wireless technologies to improve message delivery in opportunistic networks.
Proceedings of the 14th IEEE International Conference on Networking, Sensing and Control, 2017

Variable-Size Batched Gauss-Huard for Block-Jacobi Preconditioning.
Proceedings of the International Conference on Computational Science, 2017

Selecting the optimal buffer management for opportunistic networks both in pedestrian and vehicular contexts.
Proceedings of the 14th IEEE Annual Consumer Communications & Networking Conference, 2017

Mobility as the Main Enabler of Opportunistic Data Dissemination in Urban Scenarios.
Proceedings of the Ad-hoc, Mobile, and Wireless Networks, 2017

2016
Friendly-Sharing: Improving the Performance of City Sensoring through Contact-Based Messaging Applications.
Sensors, 2016

MuffinEc: Error correction for de Novo assembly via greedy partitioning and sequence alignment.
Inf. Sci., 2016

Evaluating the Impact of Data Transfer Time and Mobility Patterns in Opportunistic Networks.
Proceedings of the 2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, 2016

Improving Message Delivery Performance in Opportunistic Networks Using a Forced-Stop Diffusion Scheme.
Proceedings of the Ad-hoc, Mobile, and Wireless Networks - 15th International Conference, 2016

2014
Inexact Sequence Mapping Study Cases: Hybrid GPU Computing and Memory Demanding Indexes.
Proceedings of the International Work-Conference on Bioinformatics and Biomedical Engineering, 2014

Robust Error Correction for De Novo Assembly via Spectral Partitioning and Sequence Alignment.
Proceedings of the International Work-Conference on Bioinformatics and Biomedical Engineering, 2014

A Fast Sparse Block Circulant Matrix Vector Product.
Proceedings of the Euro-Par 2014 Parallel Processing, 2014

2012
Using GPUs for the Exact Alignment of Short-Read Genetic Sequences by Means of the Burrows-Wheeler Transform.
IEEE ACM Trans. Comput. Biol. Bioinform., 2012

Parallelization of the QR Decomposition with Column Pivoting Using Column Cyclic Distribution on Multicore and GPU Processors.
Proceedings of the High Performance Computing for Computational Science, 2012

Advancing Large Scale Many-Body QMC Simulations on GPU Accelerated Multicore Systems.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012

2007
Parallel Arnoldi eigensolvers with enhanced scalability via global communications rearrangement.
Parallel Comput., 2007

2006
Evaluation of Several Variants of Explicitly Restarted Lanczos Eigensolvers and Their Parallel Implementations.
Proceedings of the High Performance Computing for Computational Science, 2006

2005
A Parallel Variant of the Gram-Schmidt Process with Reorthogonalization.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005


  Loading...