STR: Hybrid Tensor Re-Generation to Break Memory Wall for DNN Training.
IEEE Trans. Parallel Distributed Syst., August, 2023
SmartMoE: Efficiently Training Sparsely-Activated Models through Combining Offline and Online Parallelization.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023
MespaConfig: Memory-Sparing Configuration Auto-Tuning for Co-Located In-Memory Cluster Computing Jobs.
IEEE Trans. Serv. Comput., 2022
A Swap Dominated Tensor Re-Generation Strategy for Training Deep Learning Models.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022
MM-CPred: A Multi-task Predictive Model for Continuous-Time Event Sequences with Mixture Learning Losses.
Proceedings of the Database Systems for Advanced Applications, 2021
Scaleplus: Towards Fast Scaling of Distributed Streaming Dataflows.
Proceedings of the IEEE International Conference on Parallel & Distributed Processing with Applications, 2020
An Approach for Process Model Extraction by Multi-grained Text Classification.
Proceedings of the Advanced Information Systems Engineering, 2020
Workload-Adaptive Configuration Tuning for Hierarchical Cloud Schedulers.
IEEE Trans. Parallel Distributed Syst., 2019
AdaptiveConfig: Run-Time Configuration of Cluster Schedulers for Cloud Short-Running Jobs.
Proceedings of the 38th IEEE International Conference on Distributed Computing Systems, 2018
CloudMix: Generating Diverse and Reducible Workloads for Cloud Systems.
Proceedings of the 2017 IEEE 10th International Conference on Cloud Computing (CLOUD), 2017