2024
P/D-Serve: Serving Disaggregated Large Language Model at Scale.
CoRR, 2024