2025
Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning.
CoRR, April, 2025

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks.
CoRR, April, 2025

DAPO: An Open-Source LLM Reinforcement Learning System at Scale.
CoRR, March, 2025

Fully Bayesian differential Gaussian processes through stochastic differential equations.
Knowl. Based Syst., 2025

AutoCCL: Automated Collective Communication Tuning for Accelerating Distributed and Parallel DNN Training.
Proceedings of the 22nd USENIX Symposium on Networked Systems Design and Implementation, 2025

2024
Efficient Schedule Construction for Distributed Execution of Large DNN Models.
IEEE Trans. Parallel Distributed Syst., December, 2024

Adaptive Time-Varying Routing for Energy Saving and Load Balancing in Wireless Body Area Networks.
IEEE Trans. Mob. Comput., January, 2024

Natural Language Fine-Tuning.
CoRR, 2024

Flexible Bayesian Last Layer Models Using Implicit Priors and Diffusion Posterior Sampling.
CoRR, 2024

nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2024

Aceso: Efficient Parallel DNN Training through Iterative Bottleneck Alleviation.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

2023
SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction.
CoRR, 2023

2021
Efficient Data Loader for Fast Sampling-Based GNN Training on Large Graphs.
IEEE Trans. Parallel Distributed Syst., 2021

Flexible high-resolution object detection on edge devices with tunable latency.
Proceedings of the ACM MobiCom '21: The 27th Annual International Conference on Mobile Computing and Networking, 2021

Accelerating GNN training with locality-aware partial execution.
Proceedings of the APSys '21: 12th ACM SIGOPS Asia-Pacific Workshop on Systems, 2021

2020
PaGraph: Scaling GNN training on large graphs via computation-aware caching.
Proceedings of the SoCC '20: ACM Symposium on Cloud Computing, 2020

2019
Occlumency: Privacy-preserving Remote Deep-learning Inference Using SGX.
Proceedings of the 25th Annual International Conference on Mobile Computing and Networking, 2019

2014
LED Virtual Simulation based on Web3D.
J. Multim., 2014