2025
Understanding Stragglers in Large Model Training Using What-if Analysis.
CoRR, May, 2025

Helios: Efficient Distributed Dynamic Graph Sampling for Online GNN Inference.
Proceedings of the 30th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2025

2023
Helios: An Efficient Out-of-core GNN Training System on Terabyte-scale Graphs with In-memory Performance.
CoRR, 2023

Legion: Automatically Pushing the Envelope of Multi-GPU System for Billion-Scale GNN Training.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023