Filippo Mantovani

Orcid: 0000-0003-3559-4825

According to our database1, Filippo Mantovani authored at least 53 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
RAVE: RISC-V Analyzer of Vector Executions, a QEMU tracing plugin.
CoRR, 2024

Graph Computing on Long Vector Architectures (Yes, It Works!).
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

Exploiting long vectors with a CFD code: a co-design show case.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

NVIDIA Grace Superchip Early Evaluation for HPC Applications.
Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops, 2024

2023
HPCG on long-vector architectures: Evaluation and optimization on NEC SX-Aurora and RISC-V.
Future Gener. Comput. Syst., June, 2023

Top-Down Models across CPU Architectures: Applicability and Comparison in a High-Performance Computing Environment.
Inf., 2023

Compressed Real Numbers for AI: a case-study using a RISC-V CPU.
CoRR, 2023

Acceleration with long vector architectures: Implementation and evaluation of the FFT kernel on NEC SX-Aurora and RISC-V vector extension.
Concurr. Comput. Pract. Exp., 2023

Software Development Vehicles to Enable Extended and Early Co-design: A RISC-V and HPC Case of Study.
Proceedings of the High Performance Computing, 2023

Short Reasons for Long Vectors in HPC CPUs: A Study Based on RISC-V.
Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, 2023

2022
Asymmetric HMMs for Online Ball-Bearing Health Assessments.
IEEE Internet Things J., 2022

A portable coding strategy to exploit vectorization on combustion simulations.
CoRR, 2022

2021
Efficiently running SpMV on long vector architectures.
Proceedings of the PPoPP '21: 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021

Accelerating FFT Using NEC SX-Aurora Vector Engine.
Proceedings of the Euro-Par 2021: Parallel Processing Workshops, 2021

Cluster of emerging technology: evaluation of a production HPC system based on A64FX.
Proceedings of the IEEE International Conference on Cluster Computing, 2021

2020
Runtime mechanisms to survive new HPC architectures: A use case in human respiratory simulations.
Int. J. High Perform. Comput. Appl., 2020

Performance and energy consumption of HPC workloads on a cluster based on Arm ThunderX2 CPU.
Future Gener. Comput. Syst., 2020

Performance study of HPC applications on an Arm-based cluster using a generic efficiency model.
Proceedings of the 28th Euromicro International Conference on Parallel, 2020

Benchmarking of state-of-the-art HPC Clusters with a Production CFD Code.
Proceedings of the PASC '20: Platform for Advanced Scientific Computing Conference, Geneva, Switzerland, June 29, 2020

CoreNEURON: Performance and Energy Efficiency Evaluation on Intel and Arm CPUs.
Proceedings of the IEEE International Conference on Cluster Computing, 2020

2019
Containers in HPC: A Scalability and Portability Study in Production Biological Simulations.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Design Space Exploration of Next-Generation HPC Machines.
Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium, 2019

Open-Source Shared Memory implementation of the HPCG benchmark: analysis, improvements and evaluation on Cavium ThunderX2.
Proceedings of the 17th International Conference on High Performance Computing & Simulation, 2019

TensorFlow on State-of-the-Art HPC Clusters: A Machine Learning use Case.
Proceedings of the 19th IEEE/ACM International Symposium on Cluster, 2019

2018
Efficient CFD code implementation for the ARM-based Mont-Blanc architecture.
Future Gener. Comput. Syst., 2018

Filling the gap between education and industry: evidence-based methods for introducing undergraduate students to HPC.
Proceedings of the 2018 IEEE/ACM Workshop on Education for High-Performance Computing, 2018

Teaching HPC Systems and Parallel Programming with Small-Scale Clusters.
Proceedings of the 2018 IEEE/ACM Workshop on Education for High-Performance Computing, 2018

Advanced Performance Analysis of HPC Workloads on Cavium ThunderX.
Proceedings of the 2018 International Conference on High Performance Computing & Simulation, 2018

Computational Fluid and Particle Dynamics Simulations for Respiratory System: Runtime Optimization on an Arm Cluster.
Proceedings of the 47th International Conference on Parallel Processing, 2018

2017
Energy Analysis of a 4D Variational Data Assimilation Algorithm and Evaluation on ARM-Based HPC Systems.
Proceedings of the Parallel Processing and Applied Mathematics, 2017

Implementation of the K-Means Algorithm on Heterogeneous Devices: A Use Case Based on an Industrial Dataset.
Proceedings of the Parallel Computing is Everywhere, 2017

Multi-Node Advanced Performance and Power Analysis with Paraver.
Proceedings of the Parallel Computing is Everywhere, 2017

2016

2014
Janus II: A new generation application-driven computer for spin-system simulations.
Comput. Phys. Commun., 2014

High Performance Computing based on embedded processors.
Proceedings of the International Conference on High Performance Computing & Simulation, 2014

2013
An Optimized Lattice Boltzmann Code for BlueGene/Q.
Proceedings of the Parallel Processing and Applied Mathematics, 2013

Early Experience on Porting and Running a Lattice Boltzmann Code on the Xeon-Phi Co-Processor.
Proceedings of the International Conference on Computational Science, 2013

2012
Reconfigurable computing for Monte Carlo simulations: results and prospects of the Janus project
CoRR, 2012



2011
Optimization of Multi-Phase Compressible Lattice Boltzmann Codes on Massively Parallel Multi-Core Systems.
Proceedings of the International Conference on Computational Science, 2011

Lattice Boltzmann method simulations on massively parallel multi-core architectures.
Proceedings of the 2011 Spring Simulation Multi-conference, 2011

A Multi-GPU Implementation of a D2Q37 Lattice Boltzmann Code.
Proceedings of the Parallel Processing and Applied Mathematics, 2011

2010
Lattice Boltzmann fluid-dynamics on the QPACE supercomputer.
Proceedings of the International Conference on Computational Science, 2010

Monte Carlo Simulations of Spin Systems on Multi-core Processors.
Proceedings of the Applied Parallel and Scientific Computing, 2010

2009
Janus: a recongurable system for scientic computing.
PhD thesis, 2009

Janus: An FPGA-Based System for High-Performance Scientific Computing.
Comput. Sci. Eng., 2009

Monte Carlo Simulations of Spin Glass Systems on the Cell Broadband Engine.
Proceedings of the Parallel Processing and Applied Mathematics, 2009

2008
Simulating spin systems on IANUS, an FPGA-based computer.
Comput. Phys. Commun., 2008

2007
IANUS: an FPGA-based System for High Performance Scientific Computing
CoRR, 2007


2006
Ianus: An Adaptive FPGA Computer.
Comput. Sci. Eng., 2006

Poster reception - IANUS: scientific computing on an FPGA-based architecture.
Proceedings of the ACM/IEEE SC2006 Conference on High Performance Networking and Computing, 2006


  Loading...