Understanding Stragglers in Large Model Training Using What-if Analysis.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, May, 2025
Helios: Efficient Distributed Dynamic Graph Sampling for Online GNN Inference.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025
Helios: An Efficient Out-of-core GNN Training System on Terabyte-scale Graphs with In-memory Performance.
CoRR, 2023
Legion: Automatically Pushing the Envelope of Multi-GPU System for Billion-Scale GNN Training.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2023 USENIX Annual Technical Conference, 2023