Wonbeom Lee
According to our database1,
Wonbeom Lee
authored at least 2 papers
in 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024
Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization.
Proceedings of the 51st ACM/IEEE Annual International Symposium on Computer Architecture, 2024