Flavio Vella

Orcid: 0000-0002-5676-9228

Affiliations:
  • University of Trento, Italy


According to our database1, Flavio Vella authored at least 50 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Assessing the Impact of Compiler Optimizations on GPUs Reliability.
ACM Trans. Archit. Code Optim., June, 2024

The Landscape of GPU-Centric Communication.
CoRR, 2024

Dependable Classical-Quantum Computer Systems Engineering.
CoRR, 2024

cuVegas: Accelerate Multidimensional Monte Carlo Integration through a Parallelized CUDA-based Implementation of the VEGAS Enhanced Algorithm.
CoRR, 2024

State of practice: evaluating GPU performance of state vector and tensor network methods.
CoRR, 2024

On the Efficacy of Surface Codes in Compensating for Radiation Events in Superconducting Devices.
Proceedings of the International Conference for High Performance Computing, 2024

Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects.
Proceedings of the International Conference for High Performance Computing, 2024

High Performance Unstructured SpMM Computation Using Tensor Cores.
Proceedings of the International Conference for High Performance Computing, 2024

Scaling Expected Force: Efficient Identification of Key Nodes in Network-Based Epidemic Models.
Proceedings of the 32nd Euromicro International Conference on Parallel, 2024

2023
A Multi-GPU Aggregation-Based AMG Preconditioner for Iterative Linear Solvers.
IEEE Trans. Parallel Distributed Syst., August, 2023


High-Performance and Programmable Attentional Graph Neural Networks with Global Tensor Formulations.
Proceedings of the International Conference for High Performance Computing, 2023

The potential of high-performance computing for the Internet of Sounds.
Proceedings of the 2023 4th International Symposium on the Internet of Sounds, 2023

2022
Dataset_ScalableEnergyGamesSolversOnGPUs.
Dataset, May, 2022

Blocking Techniques for Sparse Matrix Multiplication on Tensor Accelerators.
CoRR, 2022

ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations.
Proceedings of the SC22: International Conference for High Performance Computing, 2022

Blocking Sparse Matrices to Leverage Dense-Specific Multiplication.
Proceedings of the 12th IEEE/ACM Workshop on Irregular Applications: Architectures and Algorithms, 2022

Asynchronous Distributed-Memory Triangle Counting and LCC with RMA Caching.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

2021
Scalable Energy Games Solvers on GPUs.
IEEE Trans. Parallel Distributed Syst., 2021

On the Anatomy of Predictive Models for Accelerating GPU Convolution Kernels and Beyond.
ACM Trans. Archit. Code Optim., 2021

Algorithm Design for Tensor Units.
Proceedings of the Euro-Par 2021: Parallel Processing, 2021

Analysis of SARS-CoV-2 protein interactome map.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2021

2020
A Computational Model for Tensor Core Units.
Proceedings of the SPAA '20: 32nd ACM Symposium on Parallelism in Algorithms and Architectures, 2020

2019
A Computational Model for Tensor Core Units.
CoRR, 2019

GPU-Based Parallelism for ASP-Solving.
Proceedings of the Declarative Programming and Knowledge Management, 2019

Towards a Learning-Based Performance Modeling for Accelerating Deep Neural Networks.
Proceedings of the Computational Science and Its Applications - ICCSA 2019, 2019

2018
Graph analytics on modern massively parallel systems.
PhD thesis, 2018

Multilevel Parallelism for the Exploration of Large-Scale Graphs.
IEEE Trans. Multi Scale Comput. Syst., 2018

Dynamic Merging of Frontiers for Accelerating the Evaluation of Betweenness Centrality.
ACM J. Exp. Algorithmics, 2018

Strategies and systems towards grids and clouds integration: A DBMS-based solution.
Future Gener. Comput. Syst., 2018

A model-driven approach for a new generation of adaptive libraries.
CoRR, 2018

Multi-objective autotuning of MobileNets across the full software/hardware stack.
Proceedings of the 1st on Reproducible Quality-Efficient Systems Tournament on Co-designing Pareto-efficient Deep Learning, 2018

2017
Effectively and Efficiently Supporting Grid and Cloud Integration via a DBMS-based Framework.
Proceedings of the 25th Italian Symposium on Advanced Database Systems, 2017

Scaling betweenness centrality using communication-efficient sparse matrix multiplication.
Proceedings of the International Conference for High Performance Computing, 2017

Accelerating Energy Games Solvers on Modern Architectures.
Proceedings of the Seventh Workshop on Irregular Applications: Architectures and Algorithms, 2017

Transparent Caching for RMA Systems.
Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium, 2017

2016
Algorithms and Heuristics for Scalable Betweenness Centrality Computation on Multi-GPU Systems.
CoRR, 2016

Betweenness Centrality is more Parallelizable than Dense Matrix Multiplication.
CoRR, 2016

A GPU Implementation of the ASP Computation.
Proceedings of the Practical Aspects of Declarative Languages, 2016

A DBMS-Based System for Integrating Grids and Clouds: Anatomy, Models, Functionalities.
Proceedings of the International Conference on Internet of Things and Cloud Computing, 2016

Scalable betweenness centrality on multi-GPU systems.
Proceedings of the ACM International Conference on Computing Frontiers, CF'16, 2016

2015
Solutions to the st-connectivity problem using a GPU-based distributed BFS.
J. Parallel Distributed Comput., 2015

Betweenness centrality on Multi-GPU systems.
Proceedings of the 5th Workshop on Irregular Applications - Architectures and Algorithms, 2015

Parallel Execution of the ASP Computation - an Investigation on GPUs.
Proceedings of the Technical Communications of the 31st International Conference on Logic Programming (ICLP 2015), Cork, Ireland, August 31, 2015

A Simulation Framework for Efficient Resource Management on Hybrid Systems.
Proceedings of the 18th IEEE International Conference on Computational Science and Engineering, 2015

2014
On multiple learning schemata in conflict driven solvers.
Proceedings of the 15th Italian Conference on Theoretical Computer Science, 2014

2013
CUD@ASP: Experimenting with GPGPUs in ASP solving.
Proceedings of the 28th Italian Conference on Computational Logic, 2013

2012
A Simulation Framework for Scheduling Performance Evaluation on CPU-GPU Heterogeneous System.
Proceedings of the Computational Science and Its Applications - ICCSA 2012, 2012

2011
GPU Computing in EGI Environment Using a Cloud Approach.
Proceedings of the International Conference on Computational Science and Its Applications, 2011

2010
The AES Implantation Based on OpenCL for Multi/many Core Architecture.
Proceedings of the Prodeedings of the 2010 International Conference on Computational Science and Its Applications, 2010


  Loading...