Fabrizio Petrini
Orcid: 0000-0002-4977-7107
According to our database1,
Fabrizio Petrini
authored at least 137 papers
between 1991 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Efficient Parallel Multi-Hop Reasoning: A Scalable Approach for Knowledge Graph Analysis.
CoRR, 2024
CoRR, 2024
Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures, 2024
A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024
2023
The Intel Programmable and Integrated Unified Memory Architecture Graph Analytics Processor.
IEEE Micro, 2023
Dagstuhl Reports, 2023
Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures, 2023
Characterizing the Scalability of Graph Convolutional Networks on Intel<sup>®</sup> PIUMA.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2023
Dynamic Tensor Linearization and Time Slicing for Efficient Factorization of Infinite Data Streams.
Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2023
2022
SU3_Bench on a Programmable Integrated Unified Memory Architecture (PIUMA) and How that Differs from Standard NUMA CPUs.
Proceedings of the High Performance Computing - 37th International Conference, 2022
Proceedings of the SC22: International Conference for High Performance Computing, 2022
Proceedings of the ICS '22: 2022 International Conference on Supercomputing, Virtual Event, June 28, 2022
Proceedings of the 29th IEEE International Conference on High Performance Computing, 2022
2021
A New Parallel Algorithm for Sinkhorn Word-Movers Distance and Its Performance on PIUMA and Xeon CPU.
CoRR, 2021
Performance Optimization of SU3_Bench on Xeon and Programmable Integrated Unified Memory Architecture.
CoRR, 2021
Lessons Learned from Accelerating Quicksilver on Programmable Integrated Unified Memory Architecture (PIUMA) and How That's Different from CPU.
Proceedings of the High Performance Computing - 36th International Conference, 2021
Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium, 2021
Proceedings of the ICS '21: 2021 International Conference on Supercomputing, 2021
Proceedings of the IEEE Symposium on High-Performance Interconnects, 2021
2020
Introduction to the TOPC Special Issue on Innovations in Systems for Irregular Applications, Part 2.
ACM Trans. Parallel Comput., 2020
Introduction to the TOPC Special Issue on Innovations in Systems for Irregular Applications, Part 1.
ACM Trans. Parallel Comput., 2020
An Efficient Shared-memory Parallel Sinkhorn-Knopp Algorithm to Compute the Word Mover's Distance.
CoRR, 2020
CoRR, 2020
Prune the Unnecessary: Parallel Pull-Push Louvain Algorithms with Automatic Edge Pruning.
Proceedings of the ICPP 2020: 49th International Conference on Parallel Processing, 2020
Proceedings of the 27th IEEE International Conference on High Performance Computing, 2020
2017
IEEE Trans. Parallel Distributed Syst., 2017
Exploring optimizations on shared-memory platforms for parallel triangle counting algorithms.
Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017
Proceedings of the 2017 IEEE High Performance Extreme Computing Conference, 2017
2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016
Proceedings of the 2016 IEEE International Parallel and Distributed Processing Symposium, 2016
Proceedings of the 2016 International Conference on Supercomputing, 2016
2015
Proceedings of the International Conference for High Performance Computing, 2015
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium, 2015
2014
IEEE Micro, 2014
Proceedings of the Supercomputing - 29th International Conference, 2014
Proceedings of the Supercomputing - 29th International Conference, 2014
Traversing Trillions of Edges in Real Time: Graph Exploration on Large-Scale Parallel Machines.
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014
Proceedings of the 2014 IEEE 28th International Parallel and Distributed Processing Symposium, 2014
2013
2012
IEEE Micro, 2012
Proceedings of the SC Conference on High Performance Computing Networking, 2012
Breaking the speed and scalability barriers for graph exploration on distributed-memory machines.
Proceedings of the SC Conference on High Performance Computing Networking, 2012
Performance evaluation of interthread communicationmechanisms on multicore/multithreaded architectures.
Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing, 2012
2011
Comput. Sci. Res. Dev., 2011
Characterization of the Communication Patterns of Scientific Applications on Blue Gene/P.
Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing, 2011
Proceedings of the Euro-Par 2011 Parallel Processing - 17th International Conference, 2011
2010
DotStar: breaking the scalability and performance barriers in parsing regular expressions.
Comput. Sci. Res. Dev., 2010
IEEE Comput. Archit. Lett., 2010
Proceedings of the Conference on High Performance Computing Networking, 2010
Proceedings of the 24th IEEE International Symposium on Parallel and Distributed Processing, 2010
Proceedings of the Euro-Par 2010 - Parallel Processing, 16th International Euro-Par Conference, Ischia, Italy, August 31, 2010
Proceedings of the Scientific Computing with Multicore and Accelerators., 2010
Proceedings of the Scientific Computing with Multicore and Accelerators., 2010
2009
IEEE Trans. Parallel Distributed Syst., 2009
Comput. Sci. Res. Dev., 2009
Proceedings of the ACM/IEEE Conference on High Performance Computing, 2009
Fulcrum's FocalPoint FM4000: A Scalable, Low-Latency 10GigE Switch for High-Performance Data Centers.
Proceedings of the 17th IEEE Symposium on High Performance Interconnects, 2009
2008
IEEE Trans. Parallel Distributed Syst., 2008
Proceedings of the 22nd IEEE International Symposium on Parallel and Distributed Processing, 2008
Proceedings of the 5th Conference on Computing Frontiers, 2008
2007
Towards Fault Resilient Global Arrays.
Proceedings of the Parallel Computing: Architectures, 2007
Challenges in Mapping Graph Exploration Algorithms on Advanced Multi-core Processors.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007
Multicore Surprises: Lessons Learned from Optimizing Sweep3D on the Cell Broadband Engine.
Proceedings of the 21th International Parallel and Distributed Processing Symposium (IPDPS 2007), 2007
Proceedings of the 2007 IEEE International Conference on Cluster Computing, 2007
2006
IEEE Trans. Computers, 2006
Int. J. High Perform. Comput. Netw., 2006
A Locality-Aware Cooperative Cache Management Protocol to Improve Network File System Performance.
Proceedings of the 26th IEEE International Conference on Distributed Computing Systems (ICDCS 2006), 2006
2005
IEEE Trans. Parallel Distributed Syst., 2005
Transparent, Incremental Checkpointing at Kernel Level: a Foundation for Fault Tolerance for Parallel Computers.
Proceedings of the ACM/IEEE SC2005 Conference on High Performance Networking and Computing, 2005
Proceedings of the Recent Advances in Parallel Virtual Machine and Message Passing Interface, 2005
Current Practice and a Direction Forward in Checkpoint/Restart Implementations for Fault Tolerance.
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005
Proceedings of the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), 2005
Proceedings of the 13th Annual IEEE Symposium on High Performance Interconnects (HOTIC 2005), 2005
2004
Int. J. High Perform. Comput. Appl., 2004
Proceedings of the ACM/IEEE SC2004 Conference on High Performance Networking and Computing, 2004
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004
System-Level Fault-Tolerance in Large-Scale Parallel Machines with Buffered Coscheduling.
Proceedings of the 18th International Parallel and Distributed Processing Symposium (IPDPS 2004), 2004
Proceedings of the 33rd International Conference on Parallel Processing (ICPP 2004), 2004
What are the future trends in high-performance inter.connects for parallel computers? [Panel 1].
Proceedings of the 12th Annual IEEE Symposium on High Performance Interconnects, 2004
Proceedings of the Euro-Par 2004 Parallel Processing, 2004
Proceedings of the Euro-Par 2004 Parallel Processing, 2004
2003
Concurr. Comput. Pract. Exp., 2003
The Case of the Missing Supercomputer Performance: Achieving Optimal Performance on the 8, 192 Processors of ASCI Q.
Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003
Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003
BCS-MPI: A New Approach in the System Software Design for Large-Scale Parallel Computers.
Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003
Proceedings of the ACM/IEEE SC2003 Conference on High Performance Networking and Computing, 2003
Proceedings of the Job Scheduling Strategies for Parallel Processing, 2003
Flexible CoScheduling: Mitigating Load Imbalance and Improving Utilization of Heterogeneous Resources.
Proceedings of the 17th International Parallel and Distributed Processing Symposium (IPDPS 2003), 2003
Proceedings of the 11th Annual IEEE Symposium on High Performance Interconnects, 2003
2002
Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002
Performance Evaluation of I/O Traffic and Placement of I/O Nodes on a High Performance Network.
Proceedings of the 16th International Parallel and Distributed Processing Symposium (IPDPS 2002), 2002
Proceedings of the 2002 IEEE International Conference on Cluster Computing (CLUSTER 2002), 2002
2001
Parallel Algorithms Appl., 2001
Proceedings of the 2001 ACM/IEEE conference on Supercomputing, 2001
Proceedings of the IEEE International Symposium on Network Computing and Applications (NCA 2001), 2001
Proceedings of the 15th International Parallel & Distributed Processing Symposium (IPDPS-01), 2001
Proceedings of the 30th International Workshops on Parallel Processing (ICPP 2001 Workshops), 2001
Proceedings of the Ninth Symposium on High Performance Interconnects, 2001
2000
Comput. Artif. Intell., 2000
Proceedings of the Job Scheduling Strategies for Parallel Processing, IPDPS 2000 Workshop, 2000
Buffered Coscheduling: A New Methodology for Multitasking Parallel Jobs on Distributed Systems.
Proceedings of the 14th International Parallel & Distributed Processing Symposium (IPDPS'00), 2000
Proceedings of the 2000 International Conference on Parallel Processing, 2000
Proceedings of the 20th International Conference on Distributed Computing Systems, 2000
1999
A New Approach to Parallel Program Development and Scheduling of Parallel Jobs on Distributed Systems.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1999
1998
Int. J. Found. Comput. Sci., 1998
Proceedings of the 12th International Parallel Processing Symposium / 9th Symposium on Parallel and Distributed Processing (IPPS/SPDP '98), March 30, 1998
1997
Routing in Bidirectional k-ary n-cubes with the Red Rover Algorithm.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1997
On the Reduction of Deadlock Frequency by Limiting Message Injection in Wormhole Networks.
Proceedings of the Parallel Computer Routing and Communication, 1997
Performance Analysis of Minimal Adaptive Wormhole Routing with Time-Dependent Deadlock Recovery.
Proceedings of the 11th International Parallel Processing Symposium (IPPS '97), 1997
Proceedings of the 11th International Parallel Processing Symposium (IPPS '97), 1997
Proceedings of the 1997 International Conference on Parallel Processing (ICPP '97), 1997
Proceedings of the Fourth International on High-Performance Computing, 1997
Proceedings of the Euro-Par '97 Parallel Processing, 1997
SMART: A Simulator of Massive Architectures and Topologies.
Proceedings of the IASTED International Conference on Parallel and Distributed Systems, 1997
Proceedings of the 1997 Conference on Parallel Architectures and Compilation Techniques (PACT '97), 1997
1996
Minimal vs. non Minimal Adaptive Routing on k-ary n-cubes.
Proceedings of the International Conference on Parallel and Distributed Processing Techniques and Applications, 1996
Latency and Bandwidth Requirements of Massively Parallel Programs: FFT as a Case Study.
Proceedings of the Euro-Par '96 Parallel Processing, 1996
1991
Proceedings of the Fifteenth Annual International Computer Software and Applications Conference, 1991