Qingxuan Kang
Orcid: 0009-0005-5272-0231
According to our database1,
Qingxuan Kang
authored at least 6 papers
between 2023 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
AttentionStore: Cost-effective Attention Reuse across Multi-turn Conversations in Large Language Model Serving.
CoRR, 2024
Proceedings of the 2024 USENIX Annual Technical Conference, 2024
Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024
2023
CoRR, 2023
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023