Andrés Tomás
Orcid: 0000-0003-3969-2174
According to our database1,
Andrés Tomás
authored at least 41 papers
between 2005 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Parallel Reduced Order Modeling for Digital Twins using High-Performance Computing Workflows.
CoRR, 2024
2023
Int. J. High Perform. Comput. Appl., July, 2023
Performance-energy trade-offs of deep learning convolution algorithms on ARM processors.
J. Supercomput., June, 2023
Int. J. High Perform. Comput. Appl., March, 2023
Reformulating the direct convolution for high-performance deep learning inference on ARM processors.
J. Syst. Archit., February, 2023
Sparse matrix-vector and matrix-multivector products for the truncated SVD on graphics processors.
Concurr. Comput. Pract. Exp., 2023
Tall-and-Skinny QR Factorization for Clusters of GPUs Using High-Performance Building Blocks.
Proceedings of the Euro-Par 2023: Parallel Processing Workshops - Euro-Par 2023 International Workshops, Limassol, Cyprus, August 28, 2023
2022
BestOf: an online implementation selector for the training and inference of deep neural networks.
J. Supercomput., 2022
High performance and energy efficient inference for deep learning on multicore ARM processors using general optimization techniques and BLIS.
J. Syst. Archit., 2022
Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units.
Concurr. Comput. Pract. Exp., 2022
2020
Tall-and-skinny QR factorization with approximate Householder reflectors on graphics processors.
J. Supercomput., 2020
Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs.
Proceedings of the Euro-Par 2020: Parallel Processing Workshops, 2020
2019
ACM Trans. Math. Softw., 2019
Dynamic look-ahead in the reduction to band form for the singular value decomposition.
Parallel Comput., 2019
Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD.
Numer. Algorithms, 2019
Cholesky and Gram-Schmidt Orthogonalization for Tall-and-Skinny QR Factorizations on Graphics Processors.
Proceedings of the Euro-Par 2019: Parallel Processing, 2019
2018
Residual Replacement in Mixed-Precision Iterative Refinement for Sparse Linear Systems.
Proceedings of the High Performance Computing, 2018
Reduction to Band Form for the Singular Value Decomposition on Graphics Accelerators.
Proceedings of the 9th International Workshop on Programming Models and Applications for Multicores and Manycores, 2018
Proceedings of the 26th Euromicro International Conference on Parallel, 2018
Proceedings of the 2018 Design, Automation & Test in Europe Conference & Exhibition, 2018
2017
Empirical Study and Modeling of Vehicular Communications at Intersections in the 5 GHz Band.
Mob. Inf. Syst., 2017
On the impact of urban intersection characteristics in vehicular to vehicular (V2V) communications.
Proceedings of the 13th International Wireless Communications and Mobile Computing Conference, 2017
Evaluating the use of sub-gigahertz wireless technologies to improve message delivery in opportunistic networks.
Proceedings of the 14th IEEE International Conference on Networking, Sensing and Control, 2017
Proceedings of the International Conference on Computational Science, 2017
Selecting the optimal buffer management for opportunistic networks both in pedestrian and vehicular contexts.
Proceedings of the 14th IEEE Annual Consumer Communications & Networking Conference, 2017
Proceedings of the Ad-hoc, Mobile, and Wireless Networks, 2017
2016
Friendly-Sharing: Improving the Performance of City Sensoring through Contact-Based Messaging Applications.
Sensors, 2016
MuffinEc: Error correction for de Novo assembly via greedy partitioning and sequence alignment.
Inf. Sci., 2016
Evaluating the Impact of Data Transfer Time and Mobility Patterns in Opportunistic Networks.
Proceedings of the 2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, 2016
Improving Message Delivery Performance in Opportunistic Networks Using a Forced-Stop Diffusion Scheme.
Proceedings of the Ad-hoc, Mobile, and Wireless Networks - 15th International Conference, 2016
2014
Inexact Sequence Mapping Study Cases: Hybrid GPU Computing and Memory Demanding Indexes.
Proceedings of the International Work-Conference on Bioinformatics and Biomedical Engineering, 2014
Robust Error Correction for De Novo Assembly via Spectral Partitioning and Sequence Alignment.
Proceedings of the International Work-Conference on Bioinformatics and Biomedical Engineering, 2014
Proceedings of the Euro-Par 2014 Parallel Processing, 2014
2012
Using GPUs for the Exact Alignment of Short-Read Genetic Sequences by Means of the Burrows-Wheeler Transform.
IEEE ACM Trans. Comput. Biol. Bioinform., 2012
Parallelization of the QR Decomposition with Column Pivoting Using Column Cyclic Distribution on Multicore and GPU Processors.
Proceedings of the High Performance Computing for Computational Science, 2012
Advancing Large Scale Many-Body QMC Simulations on GPU Accelerated Multicore Systems.
Proceedings of the 26th IEEE International Parallel and Distributed Processing Symposium, 2012
2007
Parallel Arnoldi eigensolvers with enhanced scalability via global communications rearrangement.
Parallel Comput., 2007
2006
Evaluation of Several Variants of Explicitly Restarted Lanczos Eigensolvers and Their Parallel Implementations.
Proceedings of the High Performance Computing for Computational Science, 2006
2005
A Parallel Variant of the Gram-Schmidt Process with Reorthogonalization.
Proceedings of the Parallel Computing: Current & Future Issues of High-End Computing, 2005