Wei Bai

Orcid: 0000-0002-8898-8070

Affiliations:
  • Hong Kong University of Science and Technology (HKUST)


According to our database1, Wei Bai authored at least 47 papers between 2014 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Distributed Network Telemetry With Resource Efficiency and Full Accuracy.
IEEE/ACM Trans. Netw., June, 2024

Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement.
CoRR, 2024

POSTER: Opportunistic Credit-Based Transport for Reconfigurable Data Center Networks with Tidal.
Proceedings of the ACM SIGCOMM 2024 Conference: Posters and Demos, 2024

Uniform-Cost Multi-Path Routing for Reconfigurable Data Center Networks.
Proceedings of the ACM SIGCOMM 2024 Conference, 2024

Harmonic: Hardware-assisted RDMA Performance Isolation for Public Clouds.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Reverie: Low Pass Filter-Based Switch Buffer Sharing for Datacenters with RDMA and TCP Traffic.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Towards Domain-Specific Network Transport for Distributed DNN Training.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Rethinking Transport Protocols for Reconfigurable Data Centers: An Empirical Study.
Proceedings of the 1st SIGCOMM Workshop on Hot Topics in Optical Technologies and Applications in Networking, 2024

2023
Enabling ECN for Datacenter Networks With RTT Variations.
IEEE Trans. Cloud Comput., 2023

Understanding the Micro-Behaviors of Hardware Offloaded Network Stacks with Lumina.
Proceedings of the ACM SIGCOMM 2023 Conference, 2023

Understanding RDMA Microarchitecture Resources for Performance Isolation.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023


Towards a Manageable Intra-Host Network.
Proceedings of the 19th Workshop on Hot Topics in Operating Systems, 2023

FlexPass: A Case for Flexible Credit-based Transport for Datacenter Networks.
Proceedings of the Eighteenth European Conference on Computer Systems, 2023

2022
Congestion Control for Cross-Datacenter Networks.
IEEE/ACM Trans. Netw., 2022

Aeolus: A Building Block for Proactive Transport in Datacenter Networks.
IEEE/ACM Trans. Netw., 2022

2021
RepNet: Cutting Latency with Flow Replication in Data Center Networks.
IEEE Trans. Serv. Comput., 2021

Accelerating End-to-End Deep Learning Workflow With Codesign of Data Preprocessing and Scheduling.
IEEE Trans. Parallel Distributed Syst., 2021

One More Config is Enough: Saving (DC)TCP for High-Speed Extremely Shallow-Buffered Datacenters.
IEEE/ACM Trans. Netw., 2021

Providing Bandwidth Guarantees, Work Conservation and Low Latency Simultaneously in the Cloud.
IEEE Trans. Cloud Comput., 2021

Towards timeout-less transport in commodity datacenter networks.
Proceedings of the EuroSys '21: Sixteenth European Conference on Computer Systems, 2021

2020
Domain-specific Communication Optimization for Distributed DNN Training.
CoRR, 2020

OmniMon: Re-architecting Network Telemetry with Resource Efficiency and Full Accuracy.
Proceedings of the SIGCOMM '20: Proceedings of the 2020 Annual conference of the ACM Special Interest Group on Data Communication on the applications, 2020

Aeolus: A Building Block for Proactive Transport in Datacenters.
Proceedings of the SIGCOMM '20: Proceedings of the 2020 Annual conference of the ACM Special Interest Group on Data Communication on the applications, 2020

2019
Accelerating Rule-matching Systems with Learned Rankers.
Proceedings of the 2019 USENIX Annual Technical Conference, 2019

DLBooster: Boosting End-to-End Deep Learning Workflows with Offloading Data Preprocessing Pipelines.
Proceedings of the 48th International Conference on Parallel Processing, 2019

FlowShader: a Generalized Framework for GPU-accelerated VNF Flow Processing.
Proceedings of the 27th IEEE International Conference on Network Protocols, 2019

Rethinking Transport Layer Design for Distributed Machine Learning.
Proceedings of the 3rd Asia-Pacific Workshop on Networking, 2019

2018
Augmenting Proactive Congestion Control with Aeolus.
Proceedings of the 2nd Asia-Pacific Workshop on Networking, 2018

2017
Guaranteeing Deadlines for Inter-Data Center Transfers.
IEEE/ACM Trans. Netw., 2017

PIAS: Practical Information-Agnostic Flow Scheduling for Commodity Data Centers.
IEEE/ACM Trans. Netw., 2017

Resilient Datacenter Load Balancing in the Wild.
Proceedings of the Conference of the ACM Special Interest Group on Data Communication, 2017

Rate-aware flow scheduling for commodity data center networks.
Proceedings of the 2017 IEEE Conference on Computer Communications, 2017

Combining ECN and RTT for Datacenter Transport.
Proceedings of the First Asia-Pacific Workshop on Networking, 2017

Congestion Control for High-speed Extremely Shallow-buffered Datacenter Networks.
Proceedings of the First Asia-Pacific Workshop on Networking, 2017

2016
Towards Comprehensive Traffic Forecasting in Cloud Computing: Design and Application.
IEEE/ACM Trans. Netw., 2016

Explicit Path Control in Commodity Data Centers: Design and Applications.
IEEE/ACM Trans. Netw., 2016

Scheduling Mix-flows in Commodity Datacenters with Karuna.
Proceedings of the ACM SIGCOMM 2016 Conference, Florianopolis, Brazil, August 22-26, 2016, 2016

Enabling ECN in Multi-Service Multi-Queue Data Centers.
Proceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation, 2016

Enabling ECN over Generic Packet Scheduling.
Proceedings of the 12th International on Conference on emerging Networking EXperiments and Technologies, 2016

2015
Information-Agnostic Flow Scheduling for Commodity Data Centers.
Proceedings of the 12th USENIX Symposium on Networked Systems Design and Implementation, 2015

Rapier: Integrating routing and scheduling for coflow-aware data center networks.
Proceedings of the 2015 IEEE Conference on Computer Communications, 2015

Guaranteeing deadlines for inter-datacenter transfers.
Proceedings of the Tenth European Conference on Computer Systems, 2015

2014
RepFlow on node.js: Cutting Tail Latency in Data Center Networks at the Applications Layer.
CoRR, 2014

HadoopWatch: A first step towards comprehensive traffic forecasting in cloud computing.
Proceedings of the 2014 IEEE Conference on Computer Communications, 2014

PAC: Taming TCP Incast Congestion Using Proactive ACK Control.
Proceedings of the 22nd IEEE International Conference on Network Protocols, 2014

PIAS: Practical Information-Agnostic Flow Scheduling for Data Center Networks.
Proceedings of the 13th ACM Workshop on Hot Topics in Networks, 2014


  Loading...