Xin Jin

Orcid: 0000-0001-8741-5847

Affiliations:

Peking University, China
Johns Hopkins University, Department of Computer Science, Baltimore, MD, USA (former)
Princeton University, Department of Computer Science, NJ, USA (PhD 2016)

According to our database¹, Xin Jin authored at least 106 papers between 2011 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Efficient, Scalable, and Sustainable DNN Training on SoC-Clustered Edge Servers.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., December, 2024

Pyxis: Scheduling Mixed Tasks in Disaggregated Datacenters.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., September, 2024

DistMind: Efficient Resource Disaggregation for Deep Learning Workloads.

[BibT_eX]

[DOI]

IEEE/ACM Trans. Netw., June, 2024

FLASH: Heterogeneity-Aware Federated Learning at Scale.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., January, 2024

RLHFuse: Efficient RLHF Training for Large Language Models with Inter- and Intra-Stage Fusion.

[BibT_eX]

[DOI]

CoRR, 2024

DistTrain: Addressing Model and Data Heterogeneity with Disaggregated Training for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Efficient Training of Large Language Models on Distributed Infrastructures: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context Parallelism.

[BibT_eX]

[DOI]

CoRR, 2024

FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion.

[BibT_eX]

[DOI]

CoRR, 2024

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone.

[BibT_eX]

[DOI]

Caio César Teodoro Mendes

CoRR, 2024

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

CoRR, 2024

InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey of Resource-efficient LLM and Multimodal Foundation Models.

[BibT_eX]

[DOI]

CoRR, 2024

LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving.

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving.

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Burstable Cloud Block Storage with Data Processing Units.

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Fast Vector Query Processing for Large Datasets Beyond GPU Memory with Reordered Pipelining.

[BibT_eX]

[DOI]

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Jolteon: Unleashing the Promise of Serverless for Serverless Workflows.

[BibT_eX]

[DOI]

Zili Zhang

Chao Jin

Xin Jin

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs.

[BibT_eX]

[DOI]

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Unison: A Parallel-Efficient and User-Transparent Network Simulation Kernel.

[BibT_eX]

[DOI]

Proceedings of the Nineteenth European Conference on Computer Systems, 2024

SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

Rise of Distributed Deep Learning Training in the Big Model Era: From a Software Engineering Perspective.

[BibT_eX]

[DOI]

ACM Trans. Softw. Eng. Methodol., November, 2023

Rise of the Planet of Serverless Computing: A Systematic Review.

[BibT_eX]

[DOI]

ACM Trans. Softw. Eng. Methodol., September, 2023

<i>FaaSLight</i>: General Application-level Cold-start Latency Optimization for Function-as-a-Service in Serverless Computing.

[BibT_eX]

[DOI]

ACM Trans. Softw. Eng. Methodol., September, 2023

Enabling Edge-Cloud Video Analytics for Robotics Applications.

[BibT_eX]

[DOI]

IEEE Trans. Cloud Comput., 2023

Scalable and Efficient Full-Graph GNN Training for Large Graphs.

[BibT_eX]

[DOI]

Proc. ACM Manag. Data, 2023

LLMCad: Fast and Scalable On-device Large Language Model Inference.

[BibT_eX]

[DOI]

CoRR, 2023

Fast Distributed Inference Serving for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2023

Energy-Efficient GPU Clusters Scheduling for Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2023

MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters.

[BibT_eX]

[DOI]

CoRR, 2023

Automated Verification of an In-Production DNS Authoritative Engine.

[BibT_eX]

[DOI]

Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Halfmoon: Log-Optimal Fault-Tolerant Stateful Serverless Computing.

[BibT_eX]

[DOI]

Sheng Qi

Xuanzhe Liu

Xin Jin

Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates.

[BibT_eX]

[DOI]

Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Klotski: Efficient and Safe Network Migration of Large Production Datacenters.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGCOMM 2023 Conference, 2023

Understanding the Micro-Behaviors of Hardware Offloaded Network Stacks with Lumina.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGCOMM 2023 Conference, 2023

XRON: A Hybrid Elastic Cloud Overlay Network for Video Conferencing at Planetary Scale.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGCOMM 2023 Conference, 2023

Ditto: Efficient Serverless Analytics with Elastic Parallelism.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGCOMM 2023 Conference, 2023

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving.

[BibT_eX]

[DOI]

Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Fast, Approximate Vector Queries on Very Large Unstructured Datasets.

[BibT_eX]

[DOI]

Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

Transparent GPU Sharing in Container Clouds for Deep Learning Workloads.

[BibT_eX]

[DOI]

Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

Niagara: Scheduling DNN Inference Services on Heterogeneous Edge Processors.

[BibT_eX]

[DOI]

Proceedings of the Service-Oriented Computing - 21st International Conference, 2023

Disaggregated RAID Storage in Modern Datacenters.

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022

MiCS: Near-linear Scaling for Training Gigantic Model on Public Cloud.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2022

Orloj: Predictably Serving Unpredictable DNNs.

[BibT_eX]

[DOI]

CoRR, 2022

LambdaLite: Application-Level Optimization for Cold Start Latency in Serverless Computing.

[BibT_eX]

[DOI]

CoRR, 2022

Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading.

[BibT_eX]

[DOI]

CoRR, 2022

Meissa: scalable network testing for programmable data planes.

[BibT_eX]

[DOI]

Proceedings of the SIGCOMM '22: ACM SIGCOMM 2022 Conference, Amsterdam, The Netherlands, August 22, 2022

Multi-resource interleaving for deep learning training.

[BibT_eX]

[DOI]

Proceedings of the SIGCOMM '22: ACM SIGCOMM 2022 Conference, Amsterdam, The Netherlands, August 22, 2022

NetVRM: Virtual Register Memory for Programmable Networks.

[BibT_eX]

[DOI]

Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation, 2022

Melon: breaking the memory wall for resource-efficient on-device machine learning.

[BibT_eX]

[DOI]

Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022

Mandheling: mixed-precision on-device DNN training with DSP offloading.

[BibT_eX]

[DOI]

Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

Multi-objective congestion control.

[BibT_eX]

[DOI]

Proceedings of the EuroSys '22: Seventeenth European Conference on Computer Systems, Rennes, France, April 5, 2022

Optimizing half precision Winograd convolution on ARM many-core processors.

[BibT_eX]

[DOI]

Proceedings of the APSys '22: 13th ACM SIGOPS Asia-Pacific Workshop on Systems, Virtual Event, Singapore, August 23, 2022

2021

Demystifying Developers' Issues in Distributed Training of Deep Learning Software.

[BibT_eX]

[DOI]

CoRR, 2021

Jaqen: A High-Performance Switch-Native Approach for Detecting and Mitigating Volumetric DDoS Attacks with Programmable Switches.

[BibT_eX]

[DOI]

Proceedings of the 30th USENIX Security Symposium, 2021

Runtime Recovery of Web Applications under Zero-Day ReDoS Attacks.

[BibT_eX]

[DOI]

Proceedings of the 42nd IEEE Symposium on Security and Privacy, 2021

An empirical study on challenges of application development in serverless computing.

[BibT_eX]

[DOI]

Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

Network planning with deep reinforcement learning.

[BibT_eX]

[DOI]

Hang Zhu

Varun Gupta

Satyajeet Singh Ahuja

Yuandong Tian

Ying Zhang

Xin Jin

Proceedings of the ACM SIGCOMM 2021 Conference, Virtual Event, USA, August 23-27, 2021., 2021

Programmable packet scheduling with a single queue.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGCOMM 2021 Conference, Virtual Event, USA, August 23-27, 2021., 2021

Cost-effective data analytics across multiple cloud regions.

[BibT_eX]

[DOI]

Proceedings of the SIGCOMM '21: ACM SIGCOMM 2021 Conference, 2021

Twenty Years After: Hierarchical Core-Stateless Fair Queueing.

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation, 2021

Ship Compute or Ship Data? Why Not Both?

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation, 2021

2020

RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers (Technical Report).

[BibT_eX]

[DOI]

CoRR, 2020

Is Network the Bottleneck of Distributed Training?

[BibT_eX]

[DOI]

Proceedings of the 2020 Workshop on Network Meets AI & ML, 2020

NetLock: Fast, Centralized Lock Management Using Programmable Switches.

[BibT_eX]

[DOI]

Proceedings of the SIGCOMM '20: Proceedings of the 2020 Annual conference of the ACM Special Interest Group on Data Communication on the applications, 2020

RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers.

[BibT_eX]

[DOI]

Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Pegasus: Tolerating Skewed Workloads in Distributed Storage with In-Network Coherence Directories.

[BibT_eX]

[DOI]

Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications.

[BibT_eX]

[DOI]

Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

On Efficient Constructions of Checkpoints.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

Multitenancy for Fast and Programmable Networks in the Cloud.

[BibT_eX]

[DOI]

Proceedings of the 12th USENIX Workshop on Hot Topics in Cloud Computing, 2020

Concerto: cooperative network-wide telemetry with controllable error rate.

[BibT_eX]

[DOI]

Proceedings of the APSys '20: 11th ACM SIGOPS Asia-Pacific Workshop on Systems, 2020

2019

Harmonia: Near-Linear Scalability for Replicated Storage with In-Network Conflict Detection.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2019

DistCache: Provable Load Balancing for Large-Scale Storage Systems with Distributed Caching.

[BibT_eX]

[DOI]

Proceedings of the 2019 USENIX Annual Technical Conference, 2019

Neural packet classification.

[BibT_eX]

[DOI]

Proceedings of the ACM Special Interest Group on Data Communication, 2019

Flash: efficient dynamic routing for offchain networks.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Emerging Networking Experiments And Technologies, 2019

QPipe: quantiles sketch fully in the data plane.

[BibT_eX]

[DOI]

Proceedings of the 15th International Conference on Emerging Networking Experiments And Technologies, 2019

2018

NetChain: Scale-Free Sub-RTT Coordination (Extended Version).

[BibT_eX]

[DOI]

CoRR, 2018

AWStream: adaptive wide-area streaming analytics.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication, 2018

ASAP: Fast, Approximate Graph Pattern Mining at Scale.

[BibT_eX]

[DOI]

Anand Padmanabha Iyer

Zaoxing Liu

Xin Jin

Shivaram Venkataraman

Vladimir Braverman

Ion Stoica

Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

NetChain: Scale-Free Sub-RTT Coordination.

[BibT_eX]

[DOI]

Proceedings of the 15th USENIX Symposium on Networked Systems Design and Implementation, 2018

Proactive Video Push for Optimizing Bandwidth Consumption in Hybrid CDN-P2P VoD Systems.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Communications, 2018

Towards Fast and Scalable Graph Pattern Mining.

[BibT_eX]

[DOI]

Anand Padmanabha Iyer

Zaoxing Liu

Xin Jin

Shivaram Venkataraman

Vladimir Braverman

Ion Stoica

Proceedings of the 10th USENIX Workshop on Hot Topics in Cloud Computing, 2018

DumbNet: a smart data center network fabric with dumb switches.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth EuroSys Conference, 2018

2017

SnapLink: Fast and Accurate Vision-Based Appliance Control in Large Commercial Buildings.

[BibT_eX]

[DOI]

Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2017

NetCache: Balancing Key-Value Stores with Fast In-Network Caching.

[BibT_eX]

[DOI]

Proceedings of the 26th Symposium on Operating Systems Principles, 2017

SketchVisor: Robust Network Measurement for Software Packet Processing.

[BibT_eX]

[DOI]

Proceedings of the Conference of the ACM Special Interest Group on Data Communication, 2017

Competitive analysis for online scheduling in software-defined optical WAN.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Communications, 2017

Catalyst: Unlocking the Power of Choice to Speed up Network Updates.

[BibT_eX]

[DOI]

Rohan Gandhi

Ori Rottenstreich

Xin Jin

Proceedings of the 13th International Conference on emerging Networking EXperiments and Technologies, 2017

2016

Dynamic Control of Software-Defined Networks

[BibT_eX]

[DOI]

Xin Jin

PhD thesis, 2016

Your Data Center Switch is Trying Too Hard.

[BibT_eX]

[DOI]

Xin Jin

Nathan Farrington

Jennifer Rexford

Proceedings of the Symposium on SDN Research, 2016

Optimizing Bulk Transfers with Software-Defined Optical WAN.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGCOMM 2016 Conference, Florianopolis, Brazil, August 22-26, 2016, 2016

A 12-rack, 180-server datacenter network (DCN) using multiwavelength optical switching and full stack optimization.

[BibT_eX]

[DOI]

Proceedings of the Optical Fiber Communications Conference and Exhibition, 2016

Increasing large-scale data center capacity by statistical power control.

[BibT_eX]

[DOI]

Proceedings of the Eleventh European Conference on Computer Systems, 2016

2015

Can Accurate Predictions Improve Video Streaming in Cellular Networks?

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Mobile Computing Systems and Applications, 2015

CoVisor: A Compositional Hypervisor for Software-Defined Networks.

[BibT_eX]

[DOI]

Proceedings of the 12th USENIX Symposium on Networked Systems Design and Implementation, 2015

2014

Incremental update for a compositional SDN hypervisor.

[BibT_eX]

[DOI]

Xin Jin

Jennifer Rexford

David Walker

Proceedings of the third workshop on Hot topics in software defined networking, 2014

Dynamic scheduling of network updates.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGCOMM 2014 Conference, 2014

2013

SoftCell: Taking Control of Cellular Core Networks

[BibT_eX]

[DOI]

CoRR, 2013

Intra-data-center traffic engineering with ensemble routing.

[BibT_eX]

[DOI]

Proceedings of the IEEE INFOCOM 2013, Turin, Italy, April 14-19, 2013, 2013

SoftCell: scalable and flexible cellular core network architecture.

[BibT_eX]

[DOI]

Proceedings of the Conference on emerging Networking Experiments and Technologies, 2013

2012

Virtual Switching Without a Hypervisor for a More Secure Cloud.

[BibT_eX]

[DOI]

Xin Jin

Eric Keller

Jennifer Rexford

Proceedings of the 2nd USENIX Workshop on Hot Topics in Management of Internet, 2012

2011

Quantitative Analysis of the VANET Connectivity: Theory and Application.

[BibT_eX]

[DOI]

Xin Jin

Weijie Su

Wei Yan

Proceedings of the 73rd IEEE Vehicular Technology Conference, 2011

Relative Link Quality Assessment and Hybrid Routing Scheme for Wireless Mesh Networks.

[BibT_eX]

[DOI]

Proceedings of IEEE International Conference on Communications, 2011

A study of the VANET connectivity by percolation theory.

[BibT_eX]

[DOI]

Xin Jin

Weijie Su

Yan Wei

Proceedings of the 2011 IEEE Consumer Communications and Networking Conference, 2011

Xin Jin

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...