2025
Earth+: On-Board Satellite Imagery Compression Leveraging Historical Earth Observations.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025
2024
IEEE Netw., January, 2024
RAGServe: Fast Quality-Aware RAG Systems with Configuration Adaptation.
CoRR, 2024
LLMSteer: Improving Long-Context LLM Inference by Steering Attention on Reused Contexts.
CoRR, 2024
Loss-tolerant neural video codec aware congestion control for real time video communication.
CoRR, 2024
DroidSpeak: Enhancing Cross-LLM Communication.
CoRR, 2024
SwiftQueue: Optimizing Low-Latency Applications with Swift Packet Queuing.
CoRR, 2024
Do Large Language Models Need a Content Delivery Network?
CoRR, 2024
CacheBlend: Fast Large Language Model Serving for RAG with Cached Knowledge Fusion.
CoRR, 2024
Earth+: on-board satellite imagery compression leveraging historical earth observations.
CoRR, 2024
Large Language Model Adaptation for Networking.
CoRR, 2024
Chatterbox: Robust Transport for LLM Token Streaming under Unstable Network.
CoRR, 2024
NetLLM: Adapting Large Language Models for Networking.
Proceedings of the ACM SIGCOMM 2024 Conference, 2024
CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving.
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the ACM SIGCOMM 2024 Conference, 2024
ChameleonAPI: Automatic and Efficient Customization of Neural Networks for ML Applications.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024
ARTEMIS: Adaptive Bitrate Ladder Optimization for Live Video Streaming.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024
GRACE: Loss-Resilient Real-Time Video through Neural Codecs.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024
Towards Domain-Specific Network Transport for Distributed DNN Training.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024
Eloquent: A More Robust Transmission Scheme for LLM Token Streaming.
Proceedings of the 2024 SIGCOMM Workshop on Networks for AI Computing, 2024
Concierge: Towards Accuracy-Driven Bandwidth Allocation for Video Analytics Applications in Edge Network.
Proceedings of the IEEE International Conference on Edge Computing and Communications, 2024
Fed2PKD: Bridging Model Diversity in Federated Learning via Two-Pronged Knowledge Distillation.
Proceedings of the 17th IEEE International Conference on Cloud Computing, 2024
2023
CocoSketch: High-Performance Sketch-Based Measurement Over Arbitrary Partial Key Query.
IEEE/ACM Trans. Netw., December, 2023
Run-Time Prevention of Software Integration Failures of Machine Learning APIs.
Proc. ACM Program. Lang., October, 2023
Enabling Perception-Driven Optimization in Networking.
SIGMETRICS Perform. Evaluation Rev., September, 2023
Enabling Edge-Cloud Video Analytics for Robotics Applications.
IEEE Trans. Cloud Comput., 2023
VidPlat: A Tool for Fast Crowdsourcing of Quality-of-Experience Measurements.
CoRR, 2023
CacheGen: Fast Context Loading for Language Model Applications.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Automatic and Efficient Customization of Neural Networks for ML Applications.
CoRR, 2023
Grace++: Loss-Resilient Real-Time Video Communication under High Network Latency.
CoRR, 2023
RECL: Responsive Resource-Efficient Continuous Learning for Video Analytics.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023
Towards Optimal Preemptive GPU Time-Sharing for Edge Model Serving.
Proceedings of the 9th International Workshop on Container Technologies and Container Clouds, 2023
Gemini: Divide-and-Conquer for Practical Learning-Based Internet Congestion Control.
Proceedings of the IEEE INFOCOM 2023, 2023
Estimating WebRTC Video QoE Metrics Without Using Application Headers.
Proceedings of the 2023 ACM on Internet Measurement Conference, 2023
OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023
Online Profiling and Adaptation of Quality Sensitivity for Internet Video.
Proceedings of the 2023 ACM Symposium on Cloud Computing, SoCC 2023, 2023
Raising the Level of Abstraction for Time-State Analytics With the Timeline Framework.
Proceedings of the 13th Conference on Innovative Data Systems Research, 2023
2022
Enabling Personalized Video Quality Optimization with VidHoc.
CoRR, 2022
GRACE: Loss-Resilient Real-Time Video Communication Using Data-Scalable Autoencoder.
CoRR, 2022
AccMPEG: Optimizing Video Encoding for Video Analytics.
CoRR, 2022
Automatic Curriculum Generation for Learning Adaptation in Networking.
CoRR, 2022
Understanding the potential of server-driven edge video analytics.
Proceedings of the HotMobile '22: The 23rd International Workshop on Mobile Computing Systems and Applications, Tempe, Arizona, USA, March 9, 2022
Genet: automatic curriculum generation for learning adaptation in networking.
Proceedings of the SIGCOMM '22: ACM SIGCOMM 2022 Conference, Amsterdam, The Netherlands, August 22, 2022
Privid: Practical, Privacy-Preserving Video Analytics Queries.
Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation, 2022
Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers.
Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation, 2022
Bandwidth-Efficient Multi-video Prefetching for Short Video Streaming.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
AccMPEG: Optimizing Video Encoding for Accurate Video Analytics.
Proceedings of the Fifth Conference on Machine Learning and Systems, 2022
FedDGIC: Reliable and Efficient Asynchronous Federated Learning with Gradient Compensation.
Proceedings of the 28th IEEE International Conference on Parallel and Distributed Systems, 2022
Minimizing packet retransmission for real-time video analytics.
Proceedings of the 13th Symposium on Cloud Computing, SoCC 2022, 2022
2021
BDS+: An Inter-Datacenter Data Replication System With Dynamic Bandwidth Separation.
IEEE/ACM Trans. Netw., 2021
CocoSketch: high-performance sketch-based measurement over arbitrary partial key query.
Proceedings of the ACM SIGCOMM 2021 Conference, Virtual Event, USA, August 23-27, 2021., 2021
SENSEI: Aligning Video Streaming Quality with Dynamic User Sensitivity.
Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation, 2021
Precise error estimation for sketch-based flow measurement.
Proceedings of the IMC '21: ACM Internet Measurement Conference, 2021
Towards Performance Clarity of Edge Video Analytics.
Proceedings of the 6th IEEE/ACM Symposium on Edge Computing, 2021
Sayer: Using Implicit Feedback to Optimize System Policies.
Proceedings of the SoCC '21: ACM Symposium on Cloud Computing, 2021
2020
Domain-specific Communication Optimization for Distributed DNN Training.
CoRR, 2020
A New Abstraction for Internet QoE Optimization.
CoRR, 2020
Server-Driven Video Streaming for Deep Learning Inference.
Proceedings of the SIGCOMM '20: Proceedings of the 2020 Annual conference of the ACM Special Interest Group on Data Communication on the applications, 2020
Spatula: Efficient cross-camera video analytics on large camera networks.
Proceedings of the 5th IEEE/ACM Symposium on Edge Computing, 2020
2019
Scaling Video Analytics Systems to Large Camera Deployments.
Proceedings of the 20th International Workshop on Mobile Computing Systems and Applications, 2019
E2E: embracing user heterogeneity to improve quality of experience on the web.
Proceedings of the ACM Special Interest Group on Data Communication, 2019
Zooming in on wide-area latencies to a global cloud provider.
Proceedings of the ACM Special Interest Group on Data Communication, 2019
Pano: optimizing 360° video streaming with a better understanding of quality perception.
Proceedings of the ACM Special Interest Group on Data Communication, 2019
Networked Cameras Are the New Big Data Clusters.
Proceedings of the 2019 Workshop on Hot Topics in Video Analytics and Intelligent Edges, 2019
Bridging the Edge-Cloud Barrier for Real-time Advanced Vision Analytics.
Proceedings of the 11th USENIX Workshop on Hot Topics in Cloud Computing, 2019
Rethinking Transport Layer Design for Distributed Machine Learning.
Proceedings of the 3rd Asia-Pacific Workshop on Networking, 2019
2018
Machine Learning for Networking: Workflow, Advances and Opportunities.
IEEE Netw., 2018
ReXCam: Resource-Efficient, Cross-Camera Video Analytics at Enterprise Scale.
CoRR, 2018
Seeding Deep Learning using Wireless Localization.
CoRR, 2018
Chameleon: scalable adaptation of video analytics.
Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication, 2018
Reinventing Video Streaming for Distributed Vision Analytics.
Proceedings of the 10th USENIX Workshop on Hot Topics in Cloud Computing, 2018
BDS: a centralized near-optimal overlay network for inter-datacenter data replication.
Proceedings of the Thirteenth EuroSys Conference, 2018
Demystifying Deep Learning in Networking.
Proceedings of the 2nd Asia-Pacific Workshop on Networking, 2018
2017
Pytheas: Enabling Data-Driven Quality of Experience Optimization Using Group-Based Exploration-Exploitation.
Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation, 2017
Biases in Data-Driven Networking, and What to Do About Them.
Proceedings of the 16th ACM Workshop on Hot Topics in Networks, Palo Alto, CA, USA, 2017
Unleashing the Potential of Data-Driven Networking.
Proceedings of the Communication Systems and Networks - 9th International Conference, 2017
2016
CS2P: Improving Video Bitrate Selection and Adaptation with Data-Driven Throughput Prediction.
Proceedings of the ACM SIGCOMM 2016 Conference, Florianopolis, Brazil, August 22-26, 2016, 2016
Via: Improving Internet Telephony Call Quality Using Predictive Relay Selection.
,
,
,
,
,
,
,
,
,
,
Proceedings of the ACM SIGCOMM 2016 Conference, Florianopolis, Brazil, August 22-26, 2016, 2016
CFA: A Practical Prediction System for Video QoE Optimization.
Proceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation, 2016
2015
Analyzing TCP Throughput Stability and Predictability with Implications for Adaptive Video Streaming.
CoRR, 2015
DDA: Cross-Session Throughput Prediction with Applications to Video Bitrate Selection.
CoRR, 2015
Practical, Real-time Centralized Control for CDN-based Live Video Delivery.
Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication, 2015
C3: Internet-Scale Control Plane for Video Quality Optimization.
Proceedings of the 12th USENIX Symposium on Networked Systems Design and Implementation, 2015
2014
Improving Fairness, Efficiency, and Stability in HTTP-Based Adaptive Video Streaming With Festive.
IEEE/ACM Trans. Netw., 2014
TFA: A Tunable Finite Automaton for Pattern Matching in Network Intrusion Detection Systems.
IEEE J. Sel. Areas Commun., 2014
Kangaroo: Accelerating String Matching by Running Multiple Collaborative Finite State Machines.
IEEE J. Sel. Areas Commun., 2014
Enabling near real-time central control for live video delivery in CDNs.
Proceedings of the ACM SIGCOMM 2014 Conference, 2014
Using Video-Based Measurements to Generate a Real-Time Network Traffic Map.
Proceedings of the 13th ACM Workshop on Hot Topics in Networks, 2014
EONA: Experience-Oriented Network Architecture.
Proceedings of the 13th ACM Workshop on Hot Topics in Networks, 2014
2013
StriFA: Stride Finite Automata for High-Speed Regular Expression Matching in Network Intrusion Detection Systems.
IEEE Syst. J., 2013
Shedding light on the structure of internet video quality problems in the wild.
Proceedings of the Conference on emerging Networking Experiments and Technologies, 2013
2012
MOIST: A Scalable and Parallel Moving Object Indexer with School Tracking .
Proc. VLDB Endow., 2012
Managing DFA History with Queue for Deflation DFA.
J. Netw. Syst. Manag., 2012
A case for a coordinated internet video control plane.
Proceedings of the ACM SIGCOMM 2012 Conference, 2012
Tracking millions of flows in high speed networks for application identification.
Proceedings of the IEEE INFOCOM 2012, Orlando, FL, USA, March 25-30, 2012, 2012
Reducing power of traffic manager in routers via dynamic on/off-chip scheduling.
Proceedings of the IEEE INFOCOM 2012, Orlando, FL, USA, March 25-30, 2012, 2012
Scalable Name Lookup in NDN Using Effective Name Component Encoding.
Proceedings of the 2012 IEEE 32nd International Conference on Distributed Computing Systems, 2012
Deadline-aware data plane for internet video.
Proceedings of the 2012 ACM conference on CoNEXT student workshop, 2012
2011
S3: Smart selection of sampling function for passive network measurement.
Proceedings of the IEEE 36th Conference on Local Computer Networks, 2011
Measurements on movie distribution behavior in Peer-to-Peer networks.
Proceedings of the 12th IFIP/IEEE International Symposium on Integrated Network Management, 2011
StriD²FA: Scalable Regular Expression Matching for Deep Packet Inspection.
Proceedings of IEEE International Conference on Communications, 2011
Parallel Name Lookup for Named Data Networking.
Proceedings of the Global Communications Conference, 2011
2010
Parallel DFA Architecture for Ultra High Throughput DFA-Based Pattern Matching.
IEICE Trans. Inf. Syst., 2010
NetShield: massive semantics-based vulnerability signature matching for high-speed networks.
Proceedings of the ACM SIGCOMM 2010 Conference on Applications, 2010
Deflation DFA: Remembering History is Adequate.
Proceedings of IEEE International Conference on Communications, 2010
Pattern-Based DFA for Memory-Efficient and Scalable Multiple Regular Expression Matching.
Proceedings of IEEE International Conference on Communications, 2010
Parallel Architecture for High Throughput DFA-Based Deep Packet Inspection.
Proceedings of IEEE International Conference on Communications, 2010
Cache-Based Scalable Deep Packet Inspection with Predictive Automaton.
Proceedings of the Global Communications Conference, 2010
Independent Parallel Compact Finite Automatons for Accelerating Multi-String Matching.
Proceedings of the Global Communications Conference, 2010
Skip Finite Automaton: A Content Scanning Engine to Secure Enterprise Networks.
Proceedings of the Global Communications Conference, 2010
A2C: Anti-Attack Counters for Traffic Measurement.
Proceedings of the Global Communications Conference, 2010
2009
Module-Based Finite Automata: A Scalable and Memory-Efficient Architecture for Multi-pattern Matching in Deep Packet Inspection.
Proceedings of the Communication and Networking, 2009
SPC-FA: synergic parallel compact finite automaton to accelerate multi-string matching with low memory.
Proceedings of the 2009 ACM/IEEE Symposium on Architecture for Networking and Communications Systems, 2009