Peng Cheng

Orcid: 0000-0003-4014-4757

Affiliations:
  • Microsoft Research Asia, Beijing, China
  • Tsinghua University, Beijing, China (PhD 2015)


According to our database1, Peng Cheng authored at least 58 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Automated Proof Generation for Rust Code via Self-Evolution.
CoRR, 2024

Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance.
CoRR, 2024

Toward CXL-Native Memory Tiering via Device-Side Profiling.
CoRR, 2024

Anubis: Towards Reliable Cloud AI Infrastructure via Proactive Validation.
CoRR, 2024

SuperBench: Improving Cloud AI Infrastructure Reliability with Proactive Validation.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering.
Proceedings of the 57th IEEE/ACM International Symposium on Microarchitecture, 2024

SmartNIC-Enabled Live Migration for Storage-Optimized VMs.
Proceedings of the 15th ACM SIGOPS Asia-Pacific Workshop on Systems, 2024

2023
Meili: Enabling SmartNIC as a Service in the Cloud.
CoRR, 2023

FP8-LM: Training FP8 Large Language Models.
CoRR, 2023

SPFresh: Incremental In-Place Update for Billion-Scale Vector Search.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Poster: Meili: Towards SmartNIC as a Service.
Proceedings of the ACM SIGCOMM 2023 Conference, 2023

ARK: GPU-driven Code Execution for Distributed Deep Learning.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

Tutel: Adaptive Mixture-of-Experts at Scale.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

Query Processing on Gaming Consoles.
Proceedings of the 19th International Workshop on Data Management on New Hardware, 2023

ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

Polaris: Enhancing CXL-based Memory Expanders with Memory-side Prefetching.
Proceedings of the Advanced Parallel Processing Technologies, 2023

SegaNet: An Advanced IoT Cloud Gateway for Performant and Priority-Oriented Message Delivery.
Proceedings of the 7th Asia-Pacific Workshop on Networking, 2023

SlimeMold: Hardware Load Balancer at Scale in Datacenter.
Proceedings of the 7th Asia-Pacific Workshop on Networking, 2023

MINA: Auto-scale In-network Aggregation for Machine Learning Service.
Proceedings of the 7th Asia-Pacific Workshop on Networking, 2023

2022
NetKernel: Making Network Stack Part of the Virtualized Infrastructure.
IEEE/ACM Trans. Netw., 2022

Moneo: Monitoring Fine-grained Metrics Nonintrusively in AI Infrastructure.
ACM SIGOPS Oper. Syst. Rev., 2022

Tutel: Adaptive Mixture-of-Experts at Scale.
CoRR, 2022

PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training.
Proceedings of the 2022 USENIX Annual Technical Conference, 2022

An Adaptive Deep RL Method for Non-Stationary Environments with Piecewise Stable Context.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

RuleCache: Accelerating Web Application Firewalls by On-line Learning Traffic Patterns.
Proceedings of the IEEE International Conference on Web Services, 2022

Moneo: Non-intrusive Fine-grained Monitor for AI Infrastructure.
Proceedings of the IEEE International Conference on Communications, 2022

PipeDevice: a hardware-software co-design approach to intra-host container communication.
Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies, 2022

A Disaggregate Data Collecting Approach for Loss-Tolerant Applications.
Proceedings of the 6th Asia-Pacific Workshop on Networking, 2022

OpenNetLab: Open Platform for RL-based Congestion Control for Real-Time Communications.
Proceedings of the 6th Asia-Pacific Workshop on Networking, 2022

2021
CrossoverScheduler: Overlapping Multiple Distributed Training Applications in a Crossover Manner.
CoRR, 2021

NFD: Using Behavior Models to Develop Cross-Platform Network Functions.
Proceedings of the 40th IEEE Conference on Computer Communications, 2021

Enhanced control path for repeated TCP connections.
Proceedings of the APSys '21: 12th ACM SIGOPS Asia-Pacific Workshop on Systems, 2021

Accelerating GNN training with locality-aware partial execution.
Proceedings of the APSys '21: 12th ACM SIGOPS Asia-Pacific Workshop on Systems, 2021

Towards user-defined SLA in cloud flash storage.
Proceedings of the APSys '21: 12th ACM SIGOPS Asia-Pacific Workshop on Systems, 2021

2020
Observing and Mitigating Micro-Burst Traffic in Data Center Networks.
IEEE/ACM Trans. Netw., 2020

Simulating Performance of ML Systems with Offline Profiling.
CoRR, 2020

NetKernel: Making Network Stack Part of the Virtualized Infrastructure.
Proceedings of the 2020 USENIX Annual Technical Conference, 2020

2019
Tagger: Practical PFC Deadlock Prevention in Data Center Networks.
IEEE/ACM Trans. Netw., 2019

MP-RDMA: Enabling RDMA With Multi-Path Transport in Datacenters.
IEEE/ACM Trans. Netw., 2019

BotGraph: Web Bot Detection Based on Sitemap.
CoRR, 2019

NetKernel: Making Network Stack Part of the Virtualized Infrastructure.
CoRR, 2019

Direct Universal Access: Making Data Center Resources Available to FPGA.
Proceedings of the 16th USENIX Symposium on Networked Systems Design and Implementation, 2019

DLBooster: Boosting End-to-End Deep Learning Workflows with Offloading Data Preprocessing Pipelines.
Proceedings of the 48th International Conference on Parallel Processing, 2019

2018
FUSO: Fast Multi-Path Loss Recovery for Data Center Networks.
IEEE/ACM Trans. Netw., 2018

Multi-Path Transport for RDMA in Datacenters.
Proceedings of the 15th USENIX Symposium on Networked Systems Design and Implementation, 2018

Micro-Burst in Data Centers: Observations, Analysis, and Mitigations.
Proceedings of the 2018 IEEE 26th International Conference on Network Protocols, 2018

2017
Performance analysis of randomized data fetching in cluster computing.
Proceedings of the 25th IEEE/ACM International Symposium on Quality of Service, 2017

Network Stack as a Service in the Cloud.
Proceedings of the 16th ACM Workshop on Hot Topics in Networks, Palo Alto, CA, USA, 2017

The Feniks FPGA Operating System for Cloud Computing.
Proceedings of the 8th Asia-Pacific Workshop on Systems, Mumbai, India, September 2, 2017, 2017

Memory Efficient Loss Recovery for Hardware-based Transport in Datacenter.
Proceedings of the First Asia-Pacific Workshop on Networking, 2017

2016
Micro-burst in Data Centers: Observations, Implications, and Applications.
CoRR, 2016

Fast and Cautious: Leveraging Multi-path Diversity for Transport Loss Recovery in Data Centers.
Proceedings of the 2016 USENIX Annual Technical Conference, 2016

ClickNP: Highly flexible and High-performance Network Processing with Reconfigurable Hardware.
Proceedings of the ACM SIGCOMM 2016 Conference, Florianopolis, Brazil, August 22-26, 2016, 2016

Deadlocks in Datacenter Networks: Why Do They Form, and How to Avoid Them.
Proceedings of the 15th ACM Workshop on Hot Topics in Networks, 2016

TFC: token flow control in data center networks.
Proceedings of the Eleventh European Conference on Computer Systems, 2016

2015
Slowing Little Quickens More: Improving DCTCP for Massive Concurrent Flows.
Proceedings of the 44th International Conference on Parallel Processing, 2015

2014
Catch the Whole Lot in an Action: Rapid Precise Packet Loss Notification in Data Center.
Proceedings of the 11th USENIX Symposium on Networked Systems Design and Implementation, 2014

2013
Ease the Queue Oscillation: Analysis and Enhancement of DCTCP.
Proceedings of the IEEE 33rd International Conference on Distributed Computing Systems, 2013


  Loading...