Woosuk Kwon

Orcid: 0009-0008-8870-4892

According to our database1, Woosuk Kwon authored at least 8 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Optimizing Speculative Decoding for Serving Large Language Models Using Goodput.
CoRR, 2024

2023
Efficient Memory Management for Large Language Model Serving with PagedAttention.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

SkyPilot: An Intercloud Broker for Sky Computing.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

2022
A Fast Post-Training Pruning Framework for Transformers.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learned Token Pruning for Transformers.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

2020
Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Graphene: Strong yet Lightweight Row Hammer Protection.
Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture, 2020

2016
The ATSC Link-layer Protocol (ALP): Design and Efficiency Evaluation.
IEEE Trans. Broadcast., 2016


  Loading...