×
2024
P/D-Serve: Serving Disaggregated Large Language Model at Scale.
[DOI]
Yibo Jin
,
Tao Wang
,
Huimin Lin
,
Mingyang Song
,
Peiyang Li
,
Yipeng Ma
,
Yicheng Shan
,
Zhengfan Yuan
,
Cailong Li
,
Yajing Sun
,
Tiandeng Wu
,
Xing Chu
,
Ruizhi Huan
,
Li Ma
,
Xiao You
,
Wenting Zhou
,
Yunpeng Ye
,
Wen Liu
,
Xiangkun Xu
,
Yongsheng Zhang
,
Tiantian Dong
,
Jiawei Zhu
,
Zhe Wang
,
Xijian Ju
,
Jianxun Song
,
Haoliang Cheng
,
Xiaojing Li
,
Jiandong Ding
,
Hefei Guo
,
Zhengyong Zhang
CoRR, 2024