Paolo Bientinesi
Orcid: 0000-0002-4972-7097Affiliations:
- Umea University, Sweden
- RWTH Aachen University, Germany (former)
According to our database1,
Paolo Bientinesi
authored at least 103 papers
between 2004 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on zbmath.org
-
on orcid.org
-
on d-nb.info
-
on dl.acm.org
On csauthors.net:
Bibliography
2024
CoRR, 2024
Analyzing and reducing the synthetic-to-real transfer gap in Music Information Retrieval: the task of automatic drum transcription.
CoRR, 2024
2023
2022
Work-Stealing Prefix Scan: Addressing Load Imbalance in Large-Scale Image Registration.
IEEE Trans. Parallel Distributed Syst., 2022
Algorithm 1026: Concurrent Alternating Least Squares for Multiple Simultaneous Canonical Polyadic Decompositions.
ACM Trans. Math. Softw., 2022
The Linear Algebra Mapping Problem. Current State of Linear Algebra Languages and Libraries.
ACM Trans. Math. Softw., 2022
Frontiers Appl. Math. Stat., 2022
Editorial: High-performance tensor computations in scientific computing and data science.
Frontiers Appl. Math. Stat., 2022
Dagstuhl Reports, 2022
Comput. Music. J., 2022
Proceedings of the 2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2022
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2022
Proceedings of the 51st International Conference on Parallel Processing, 2022
2021
ACM Trans. Math. Softw., 2021
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021
Performance Comparison for Scientific Computations on the Edge via Relative Performance.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium Workshops, 2021
2020
Dagstuhl Reports, 2020
Concurrent Alternating Least Squares for multiple simultaneous Canonical Polyadic Decompositions.
CoRR, 2020
Accelerating Deep Learning Inference in Constrained Embedded Devices Using Hardware Loops and a Dot Product Unit.
IEEE Access, 2020
Proceedings of the PASC '20: Platform for Advanced Scientific Computing Conference, Geneva, Switzerland, June 29, 2020
2019
J. Comput. Chem., 2019
Int. J. High Perform. Comput. Appl., 2019
2018
ACM Trans. Math. Softw., 2018
J. Comput. Sci., 2018
Optimizing AIREBO: Navigating the Journey from Complex Legacy Code to High Performance.
CoRR, 2018
Proceedings of the High Performance Computing for Computational Science - VECPAR 2018, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 International Symposium on Code Generation and Optimization, 2018
Proceedings of the 2018 International Symposium on Code Generation and Optimization, 2018
2017
ACM Trans. Math. Softw., 2017
Algorithm 979: Recursive Algorithms for Dense Linear Algebra - The ReLAPACK Collection.
ACM Trans. Math. Softw., 2017
High-performance generation of the Hamiltonian and Overlap matrices in FLAPW methods.
Comput. Phys. Commun., 2017
CoRR, 2017
CoRR, 2017
Proceedings of the High Performance Computing - 32nd International Conference, 2017
Proceedings of the 16th Python in Science Conference 2017, 2017
Proceedings of the 7th Workshop on Python for High-Performance and Scientific Computing, 2017
Proceedings of the 4th ACM SIGPLAN International Workshop on Libraries, 2017
Proceedings of the International Workshop on Parallel Symbolic Computation, 2017
2016
Appl. Math. Comput., 2016
The vectorization of the tersoff multi-body potential: an exercise in performance portability.
Proceedings of the International Conference for High Performance Computing, 2016
Proceedings of the 3rd ACM SIGPLAN International Workshop on Libraries, 2016
2015
Concurr. Comput. Pract. Exp., 2015
Proceedings of the High Performance Computing - 30th International Conference, 2015
Scalable and Efficient Linear Algebra Kernel Mapping for Low Energy Consumption on the Layers CGRA.
Proceedings of the Applied Reconfigurable Computing - 11th International Symposium, 2015
2014
Computing Petaflops over Terabytes of Data: The Case of Genome-Wide Association Studies.
ACM Trans. Math. Softw., 2014
Improved Accuracy and Parallelism for MRRR-Based Eigensolvers - A Mixed Precision Approach.
SIAM J. Sci. Comput., 2014
Appl. Math. Comput., 2014
Solving sequences of generalized least-squares problems on multi-threaded architectures.
Appl. Math. Comput., 2014
Proceedings of the High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Eugene, OR, USA, June 30, 2014
Proceedings of the High Performance Computing Systems. Performance Modeling, Benchmarking, and Simulation, 2014
2013
SIAM J. Sci. Comput., 2013
J. Comput. Appl. Math., 2013
Int. J. High Perform. Comput. Appl., 2013
Proceedings of the 20th European MPI Users's Group Meeting, 2013
Proceedings of the Euro-Par 2013 Parallel Processing, 2013
2012
SIGMETRICS Perform. Evaluation Rev., 2012
Correlations in sequences of generalized eigenproblems arising in Density Functional Theory.
Comput. Phys. Commun., 2012
CoRR, 2012
Appl. Math. Comput., 2012
Proceedings of the High Performance Computing for Computational Science, 2012
Proceedings of the 2012 SC Companion: High Performance Computing, 2012
2011
MR<sup>3</sup>-SMP: A symmetric tridiagonal eigensolver for multi-core architectures.
Parallel Comput., 2011
Condensed forms for the symmetric eigenvalue problem on multi-threaded architectures.
Concurr. Comput. Pract. Exp., 2011
Proceedings of the 2011 ACM Symposium on Applied Computing (SAC), TaiChung, Taiwan, March 21, 2011
Proceedings of the International Conference on Computational Science and Its Applications, 2011
Proceedings of the Computer Algebra in Scientific Computing - 13th International Workshop, 2011
2010
Proceedings of the International Conference on Computational Science, 2010
Matrix Structure Exploitation in Generalized Eigenproblems Arising in Density Functional Theory
CoRR, 2010
The Algorithm of Multiple Relatively Robust Representations for Multi-core Processors.
Proceedings of the Applied Parallel and Scientific Computing, 2010
Proceedings of the Euro-Par 2010 Parallel Processing Workshops, 2010
Proceedings of the Handbook of Cloud Computing., 2010
2009
Proceedings of the Parallel Processing and Applied Mathematics, 2009
Reduction to Condensed Forms for Symmetric Eigenvalue Problems on Multi-core Architectures.
Proceedings of the Parallel Processing and Applied Mathematics, 2009
2008
ACM Trans. Math. Softw., 2008
Families of algorithms related to the inversion of a Symmetric Positive Definite matrix.
ACM Trans. Math. Softw., 2008
Proceedings of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2008
2005
Representing linear algebra algorithms in code: the FLAME application program interfaces.
ACM Trans. Math. Softw., 2005
ACM Trans. Math. Softw., 2005
A Parallel Eigensolver for Dense Symmetric Matrices Based on Multiple Relatively Robust Representations.
SIAM J. Sci. Comput., 2005
2004
Automatic Derivation of Linear Algebra Algorithms with Application to Control Theory.
Proceedings of the Applied Parallel Computing, 2004
Proceedings of the Applied Parallel Computing, 2004