Alessio Netti

According to our database1, Alessio Netti authored at least 16 papers between 2017 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Mixed precision support in HPC applications: What about reliability?
J. Parallel Distributed Comput., November, 2023

HPC Hardware Design Reliability Benchmarking With HDFIT.
IEEE Trans. Parallel Distributed Syst., March, 2023

2022
Holistic and Portable Operational Data Analytics on Production HPC Systems.
PhD thesis, 2022

Operational Data Analytics in practice: Experiences from design to deployment in production HPC environments.
Parallel Comput., 2022

2021
Correlation-wise Smoothing: Lightweight Knowledge Extraction for HPC Monitoring Data.
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021

A Conceptual Framework for HPC Operational Data Analytics.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
A machine learning approach to online fault classification in HPC systems.
Future Gener. Comput. Syst., 2020

AccaSim: a customizable workload management simulator for job dispatching research in HPC systems.
Clust. Comput., 2020

Characterizing HPC Performance Variation with Monitoring and Unsupervised Learning.
Proceedings of the High Performance Computing, 2020

DCDB Wintermute: Enabling Online and Holistic Operational Data Analytics on HPC Systems.
Proceedings of the HPDC '20: The 29th International Symposium on High-Performance Parallel and Distributed Computing, 2020

2019
From facility to application sensor data: modular, continuous and holistic monitoring with DCDB.
Proceedings of the International Conference for High Performance Computing, 2019

Towards a Predictive Energy Model for HPC Runtime Systems Using Supervised Learning.
Proceedings of the Euro-Par 2019: Parallel Processing Workshops, 2019

Online Fault Classification in HPC Systems Through Machine Learning.
Proceedings of the Euro-Par 2019: Parallel Processing, 2019

2018
Heterogeneity-Aware Resource Allocation in HPC Systems.
Proceedings of the High Performance Computing - 33rd International Conference, 2018

FINJ: A Fault Injection Tool for HPC Systems.
Proceedings of the Euro-Par 2018: Parallel Processing Workshops, 2018

2017
AccaSim: An HPC Simulator for Workload Management.
Proceedings of the High Performance Computing - 4th Latin American Conference, 2017


  Loading...