Kuntai Du

Orcid: 0000-0002-3964-4079

According to our database1, Kuntai Du authored at least 18 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Do Large Language Models Need a Content Delivery Network?
CoRR, 2024

CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion.
CoRR, 2024

Earth+: on-board satellite imagery compression leveraging historical earth observations.
CoRR, 2024

Chatterbox: Robust Transport for LLM Token Streaming under Unstable Network.
CoRR, 2024

CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving.
Proceedings of the ACM SIGCOMM 2024 Conference, 2024

ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

GRACE: Loss-Resilient Real-Time Video through Neural Codecs.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Eloquent: A More Robust Transmission Scheme for LLM Token Streaming.
Proceedings of the 2024 SIGCOMM Workshop on Networks for AI Computing, 2024

2023
Run-Time Prevention of Software Integration Failures of Machine Learning APIs.
Proc. ACM Program. Lang., October, 2023

CacheGen: Fast Context Loading for Language Model Applications.
CoRR, 2023

Automatic and Efficient Customization of Neural Networks for ML Applications.
CoRR, 2023

OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023

2022
AccMPEG: Optimizing Video Encoding for Video Analytics.
CoRR, 2022

Understanding the potential of server-driven edge video analytics.
Proceedings of the HotMobile '22: The 23rd International Workshop on Mobile Computing Systems and Applications, Tempe, Arizona, USA, March 9, 2022

AccMPEG: Optimizing Video Encoding for Accurate Video Analytics.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

Minimizing packet retransmission for real-time video analytics.
Proceedings of the 13th Symposium on Cloud Computing, SoCC 2022, 2022

2020
Server-Driven Video Streaming for Deep Learning Inference.
Proceedings of the SIGCOMM '20: Proceedings of the 2020 Annual conference of the ACM Special Interest Group on Data Communication on the applications, 2020

Renovating road signs for infrastructure-to-vehicle networking: a visible light backscatter communication and networking approach.
Proceedings of the MobiCom '20: The 26th Annual International Conference on Mobile Computing and Networking, 2020


  Loading...