2024

Context Compression and Extraction: Efficiency Inference of Large Language Models.

[DOI]

Proceedings of the Advanced Intelligent Computing Technology and Applications, 2024