Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster.
CoRR, 2024
AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes.
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
M<sub>2</sub>-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining.
CoRR, 2024
Rethinking Memory and Communication Costs for Efficient Data Parallel Training of Large Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 40th IEEE International Conference on Data Engineering, 2024
An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training.
CoRR, 2023
Rethinking Memory and Communication Cost for Efficient Large Language Model Training.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023
A Study on the Determinants of Stock Returns, in Comparison of the Fama-French Models.
Proceedings of the IC4E 2022: 13th International Conference on E-Education, E-Business, E-Management, and E-Learning, Tokyo, Japan, January 14, 2022
Trust in AutoML: exploring information needs for establishing trust in automated machine learning systems.
Proceedings of the IUI '20: 25th International Conference on Intelligent User Interfaces, 2020