2025
Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs.
CoRR, March, 2025

2024
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts.
CoRR, 2024

AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster.
CoRR, 2024

AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes.
CoRR, 2024

M<sub>2</sub>-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining.
CoRR, 2024

Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes.
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

2023
An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training.
CoRR, 2023

Rethinking Memory and Communication Cost for Efficient Large Language Model Training.
CoRR, 2023

G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022
A Study on the Determinants of Stock Returns, in Comparison of the Fama-French Models.
Proceedings of the IC4E 2022: 13th International Conference on E-Education, E-Business, E-Management, and E-Learning, Tokyo, Japan, January 14, 2022

2020
Trust in AutoML: exploring information needs for establishing trust in automated machine learning systems.
Proceedings of the IUI '20: 25th International Conference on Intelligent User Interfaces, 2020