Xin Jin

Orcid: 0000-0001-8741-5847

Affiliations:
  • Peking University, China
  • Johns Hopkins University, Department of Computer Science, Baltimore, MD, USA (former)
  • Princeton University, Department of Computer Science, NJ, USA (PhD 2016)


According to our database1, Xin Jin authored at least 106 papers between 2011 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Efficient, Scalable, and Sustainable DNN Training on SoC-Clustered Edge Servers.
IEEE Trans. Mob. Comput., December, 2024

Pyxis: Scheduling Mixed Tasks in Disaggregated Datacenters.
IEEE Trans. Parallel Distributed Syst., September, 2024

DistMind: Efficient Resource Disaggregation for Deep Learning Workloads.
IEEE/ACM Trans. Netw., June, 2024

FLASH: Heterogeneity-Aware Federated Learning at Scale.
IEEE Trans. Mob. Comput., January, 2024

RLHFuse: Efficient RLHF Training for Large Language Models with Inter- and Intra-Stage Fusion.
CoRR, 2024

DistTrain: Addressing Model and Data Heterogeneity with Disaggregated Training for Multimodal Large Language Models.
CoRR, 2024

Efficient Training of Large Language Models on Distributed Infrastructures: A Survey.
CoRR, 2024

LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context Parallelism.
CoRR, 2024

FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion.
CoRR, 2024

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone.
CoRR, 2024

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation.
CoRR, 2024

InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding.
CoRR, 2024

A Survey of Resource-efficient LLM and Multimodal Foundation Models.
CoRR, 2024

LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

dLoRA: Dynamically Orchestrating Requests and Adapters for LoRA LLM Serving.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Burstable Cloud Block Storage with Data Processing Units.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Fast Vector Query Processing for Large Datasets Beyond GPU Memory with Reordered Pipelining.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Jolteon: Unleashing the Promise of Serverless for Serverless Workflows.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Unison: A Parallel-Efficient and User-Transparent Network Simulation Kernel.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Rise of Distributed Deep Learning Training in the Big Model Era: From a Software Engineering Perspective.
ACM Trans. Softw. Eng. Methodol., November, 2023

Rise of the Planet of Serverless Computing: A Systematic Review.
ACM Trans. Softw. Eng. Methodol., September, 2023

<i>FaaSLight</i>: General Application-level Cold-start Latency Optimization for Function-as-a-Service in Serverless Computing.
ACM Trans. Softw. Eng. Methodol., September, 2023

Enabling Edge-Cloud Video Analytics for Robotics Applications.
IEEE Trans. Cloud Comput., 2023

Scalable and Efficient Full-Graph GNN Training for Large Graphs.
Proc. ACM Manag. Data, 2023

LLMCad: Fast and Scalable On-device Large Language Model Inference.
CoRR, 2023

Fast Distributed Inference Serving for Large Language Models.
CoRR, 2023

Energy-Efficient GPU Clusters Scheduling for Deep Learning.
CoRR, 2023

MuxFlow: Efficient and Safe GPU Sharing in Large-Scale Production Deep Learning Clusters.
CoRR, 2023

Automated Verification of an In-Production DNS Authoritative Engine.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Halfmoon: Log-Optimal Fault-Tolerant Stateful Serverless Computing.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Oobleck: Resilient Distributed Training of Large Models Using Pipeline Templates.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Klotski: Efficient and Safe Network Migration of Large Production Datacenters.
Proceedings of the ACM SIGCOMM 2023 Conference, 2023

Understanding the Micro-Behaviors of Hardware Offloaded Network Stacks with Lumina.
Proceedings of the ACM SIGCOMM 2023 Conference, 2023

XRON: A Hybrid Elastic Cloud Overlay Network for Video Conferencing at Planetary Scale.
Proceedings of the ACM SIGCOMM 2023 Conference, 2023

Ditto: Efficient Serverless Analytics with Elastic Parallelism.
Proceedings of the ACM SIGCOMM 2023 Conference, 2023

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving.
Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Fast, Approximate Vector Queries on Very Large Unstructured Datasets.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

Transparent GPU Sharing in Container Clouds for Deep Learning Workloads.
Proceedings of the 20th USENIX Symposium on Networked Systems Design and Implementation, 2023

Niagara: Scheduling DNN Inference Services on Heterogeneous Edge Processors.
Proceedings of the Service-Oriented Computing - 21st International Conference, 2023

Disaggregated RAID Storage in Modern Datacenters.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
MiCS: Near-linear Scaling for Training Gigantic Model on Public Cloud.
Proc. VLDB Endow., 2022

Orloj: Predictably Serving Unpredictable DNNs.
CoRR, 2022

LambdaLite: Application-Level Optimization for Cold Start Latency in Serverless Computing.
CoRR, 2022

Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading.
CoRR, 2022

Meissa: scalable network testing for programmable data planes.
Proceedings of the SIGCOMM '22: ACM SIGCOMM 2022 Conference, Amsterdam, The Netherlands, August 22, 2022

Multi-resource interleaving for deep learning training.
Proceedings of the SIGCOMM '22: ACM SIGCOMM 2022 Conference, Amsterdam, The Netherlands, August 22, 2022

NetVRM: Virtual Register Memory for Programmable Networks.
Proceedings of the 19th USENIX Symposium on Networked Systems Design and Implementation, 2022

Melon: breaking the memory wall for resource-efficient on-device machine learning.
Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022

Mandheling: mixed-precision on-device DNN training with DSP offloading.
Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

Multi-objective congestion control.
Proceedings of the EuroSys '22: Seventeenth European Conference on Computer Systems, Rennes, France, April 5, 2022

Optimizing half precision Winograd convolution on ARM many-core processors.
Proceedings of the APSys '22: 13th ACM SIGOPS Asia-Pacific Workshop on Systems, Virtual Event, Singapore, August 23, 2022

2021
Demystifying Developers' Issues in Distributed Training of Deep Learning Software.
CoRR, 2021

Jaqen: A High-Performance Switch-Native Approach for Detecting and Mitigating Volumetric DDoS Attacks with Programmable Switches.
Proceedings of the 30th USENIX Security Symposium, 2021

Runtime Recovery of Web Applications under Zero-Day ReDoS Attacks.
Proceedings of the 42nd IEEE Symposium on Security and Privacy, 2021

An empirical study on challenges of application development in serverless computing.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

Network planning with deep reinforcement learning.
Proceedings of the ACM SIGCOMM 2021 Conference, Virtual Event, USA, August 23-27, 2021., 2021

Programmable packet scheduling with a single queue.
Proceedings of the ACM SIGCOMM 2021 Conference, Virtual Event, USA, August 23-27, 2021., 2021

Cost-effective data analytics across multiple cloud regions.
Proceedings of the SIGCOMM '21: ACM SIGCOMM 2021 Conference, 2021

Twenty Years After: Hierarchical Core-Stateless Fair Queueing.
Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation, 2021

Ship Compute or Ship Data? Why Not Both?
Proceedings of the 18th USENIX Symposium on Networked Systems Design and Implementation, 2021

2020
RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers (Technical Report).
CoRR, 2020

Is Network the Bottleneck of Distributed Training?
Proceedings of the 2020 Workshop on Network Meets AI & ML, 2020

NetLock: Fast, Centralized Lock Management Using Programmable Switches.
Proceedings of the SIGCOMM '20: Proceedings of the 2020 Annual conference of the ACM Special Interest Group on Data Communication on the applications, 2020

RackSched: A Microsecond-Scale Scheduler for Rack-Scale Computers.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

Pegasus: Tolerating Skewed Workloads in Distributed Storage with In-Network Coherence Directories.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications.
Proceedings of the 14th USENIX Symposium on Operating Systems Design and Implementation, 2020

On Efficient Constructions of Checkpoints.
Proceedings of the 37th International Conference on Machine Learning, 2020

Multitenancy for Fast and Programmable Networks in the Cloud.
Proceedings of the 12th USENIX Workshop on Hot Topics in Cloud Computing, 2020

Concerto: cooperative network-wide telemetry with controllable error rate.
Proceedings of the APSys '20: 11th ACM SIGOPS Asia-Pacific Workshop on Systems, 2020

2019
Harmonia: Near-Linear Scalability for Replicated Storage with In-Network Conflict Detection.
Proc. VLDB Endow., 2019

DistCache: Provable Load Balancing for Large-Scale Storage Systems with Distributed Caching.
Proceedings of the 2019 USENIX Annual Technical Conference, 2019

Neural packet classification.
Proceedings of the ACM Special Interest Group on Data Communication, 2019

Flash: efficient dynamic routing for offchain networks.
Proceedings of the 15th International Conference on Emerging Networking Experiments And Technologies, 2019

QPipe: quantiles sketch fully in the data plane.
Proceedings of the 15th International Conference on Emerging Networking Experiments And Technologies, 2019

2018
NetChain: Scale-Free Sub-RTT Coordination (Extended Version).
CoRR, 2018

AWStream: adaptive wide-area streaming analytics.
Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication, 2018

ASAP: Fast, Approximate Graph Pattern Mining at Scale.
Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

NetChain: Scale-Free Sub-RTT Coordination.
Proceedings of the 15th USENIX Symposium on Networked Systems Design and Implementation, 2018

Proactive Video Push for Optimizing Bandwidth Consumption in Hybrid CDN-P2P VoD Systems.
Proceedings of the 2018 IEEE Conference on Computer Communications, 2018

Towards Fast and Scalable Graph Pattern Mining.
Proceedings of the 10th USENIX Workshop on Hot Topics in Cloud Computing, 2018

DumbNet: a smart data center network fabric with dumb switches.
Proceedings of the Thirteenth EuroSys Conference, 2018

2017
SnapLink: Fast and Accurate Vision-Based Appliance Control in Large Commercial Buildings.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2017

NetCache: Balancing Key-Value Stores with Fast In-Network Caching.
Proceedings of the 26th Symposium on Operating Systems Principles, 2017

SketchVisor: Robust Network Measurement for Software Packet Processing.
Proceedings of the Conference of the ACM Special Interest Group on Data Communication, 2017

Competitive analysis for online scheduling in software-defined optical WAN.
Proceedings of the 2017 IEEE Conference on Computer Communications, 2017

Catalyst: Unlocking the Power of Choice to Speed up Network Updates.
Proceedings of the 13th International Conference on emerging Networking EXperiments and Technologies, 2017

2016
Dynamic Control of Software-Defined Networks
PhD thesis, 2016

Your Data Center Switch is Trying Too Hard.
Proceedings of the Symposium on SDN Research, 2016

Optimizing Bulk Transfers with Software-Defined Optical WAN.
Proceedings of the ACM SIGCOMM 2016 Conference, Florianopolis, Brazil, August 22-26, 2016, 2016

A 12-rack, 180-server datacenter network (DCN) using multiwavelength optical switching and full stack optimization.
Proceedings of the Optical Fiber Communications Conference and Exhibition, 2016

Increasing large-scale data center capacity by statistical power control.
Proceedings of the Eleventh European Conference on Computer Systems, 2016

2015
Can Accurate Predictions Improve Video Streaming in Cellular Networks?
Proceedings of the 16th International Workshop on Mobile Computing Systems and Applications, 2015

CoVisor: A Compositional Hypervisor for Software-Defined Networks.
Proceedings of the 12th USENIX Symposium on Networked Systems Design and Implementation, 2015

2014
Incremental update for a compositional SDN hypervisor.
Proceedings of the third workshop on Hot topics in software defined networking, 2014

Dynamic scheduling of network updates.
Proceedings of the ACM SIGCOMM 2014 Conference, 2014

2013
SoftCell: Taking Control of Cellular Core Networks
CoRR, 2013

Intra-data-center traffic engineering with ensemble routing.
Proceedings of the IEEE INFOCOM 2013, Turin, Italy, April 14-19, 2013, 2013

SoftCell: scalable and flexible cellular core network architecture.
Proceedings of the Conference on emerging Networking Experiments and Technologies, 2013

2012
Virtual Switching Without a Hypervisor for a More Secure Cloud.
Proceedings of the 2nd USENIX Workshop on Hot Topics in Management of Internet, 2012

2011
Quantitative Analysis of the VANET Connectivity: Theory and Application.
Proceedings of the 73rd IEEE Vehicular Technology Conference, 2011

Relative Link Quality Assessment and Hybrid Routing Scheme for Wireless Mesh Networks.
Proceedings of IEEE International Conference on Communications, 2011

A study of the VANET connectivity by percolation theory.
Proceedings of the 2011 IEEE Consumer Communications and Networking Conference, 2011


  Loading...