2025

Every FLOP Counts: Scaling a 300B Mixture-of-Experts LING LLM without Premium GPUs.

[DOI]

Ling Team

Binwei Zeng

CoRR, March, 2025

2024

Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts.

[DOI]

CoRR, 2024

AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster.

[DOI]

CoRR, 2024

AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes.

[DOI]

CoRR, 2024

M<sub>2</sub>-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining.

[DOI]

CoRR, 2024

Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes.

[DOI]

Proceedings of the 40th IEEE International Conference on Data Engineering, 2024

2023

An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training.

[DOI]

CoRR, 2023

Rethinking Memory and Communication Cost for Efficient Large Language Model Training.

[DOI]

CoRR, 2023

G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems.

[DOI]

Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

2022

A Study on the Determinants of Stock Returns, in Comparison of the Fama-French Models.

[DOI]

Proceedings of the IC4E 2022: 13th International Conference on E-Education, E-Business, E-Management, and E-Learning, Tokyo, Japan, January 14, 2022

2020

Trust in AutoML: exploring information needs for establishing trust in automated machine learning systems.

[DOI]

Proceedings of the IUI '20: 25th International Conference on Intelligent User Interfaces, 2020