×
2024
Context Compression and Extraction: Efficiency Inference of Large Language Models.
[DOI]
Junyao Zhou
,
Ruiqing Du
,
Yushan Tan
,
Jintao Yang
,
Zonghao Yang
,
Wei Luo
,
Zhunchen Luo
,
Xian Zhou
,
Wenpeng Hu
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024