We stand with Ukraine

We stand with Ukraine

Haibin Lin

Orcid: 0000-0003-4879-5335

According to our database¹, Haibin Lin authored at least 35 papers between 2013 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Minder: Faulty Machine Detection for Large-scale Distributed Model Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training.

[BibT_eX]

[DOI]

,

,

,

,

,

Chengming Zhang

,

,

,

,

,

CoRR, 2024

HybridFlow: A Flexible and Efficient RLHF Framework.

[BibT_eX]

[DOI]

Guangming Sheng

,

,

,

,

,

,

,

,

CoRR, 2024

Optimus: Accelerating Large-Scale Multi-Modal LLM Training by Bubble Exploitation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

ByteCheckpoint: A Unified Checkpointing System for LLM Development.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion.

[BibT_eX]

[DOI]

,

,

,

Chengquan Jiang

,

,

,

,

,

,

,

,

CoRR, 2024

LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

POSTER: LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming, 2024

MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Parallel and Distributed Processing Symposium, 2024

LEMON: Lossless model expansion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Nineteenth European Conference on Computer Systems, 2024

2023

Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies.

[BibT_eX]

[DOI]

,

,

,

T. S. Eugene Ng

Proceedings of the Eighteenth European Conference on Computer Systems, 2023

2022

Espresso: Revisiting Gradient Compression from the System Perspective.

[BibT_eX]

[DOI]

,

,

,

T. S. Eugene Ng

CoRR, 2022

dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2022

SAPipe: Staleness-Aware Pipeline for Data Parallel DNN Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

dPRO: A Generic Performance Diagnosis and Optimization Toolkit for Expediting Distributed DNN Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

ResNeSt: Split-Attention Networks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Alexander J. Smola

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

The World of 5G - Volume 5: Intelligent Medicine

[BibT_eX]

[DOI]

,

WorldScientific, ISBN: 9789811244216, 2022

2021

Compressed Communication for Distributed Training: Adaptive Methods and System.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2021

2020

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

J. Mach. Learn. Res., 2020

Accelerated Large Batch Optimization of BERT Pretraining in 54 minutes.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2020

Is Network the Bottleneck of Distributed Training?

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2020 Workshop on Network Meets AI & ML, 2020

CSER: Communication-efficient SGD with Error Reset.

[BibT_eX]

[DOI]

,

,

Oluwasanmi Koyejo

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Temporal-Contextual Recommendation in Real-Time.

[BibT_eX]

[DOI]

,

Balakrishnan (Murali) Narayanaswamy

,

,

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

2019

Local AdaAlter: Communication-Efficient Stochastic Gradient Descent with Adaptive Learning Rates.

[BibT_eX]

[DOI]

,

Oluwasanmi Koyejo

,

,

CoRR, 2019

Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Alexander J. Smola

,

CoRR, 2019

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2019

Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2019

Just-in-Time Dynamic-Batching.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2019

Dive into Deep Learning for Natural Language Processing.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Alexander J. Smola

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

2017

Self-Driving Database Management Systems.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Prashanth Menon

,

,

,

,

Siddharth Santurkar

,

Anthony Tomasic

,

,

,

,

,

,

Proceedings of the 8th Biennial Conference on Innovative Data Systems Research, 2017

2013

Salience-based feature preserving resizing for 3D models.

[BibT_eX]

[DOI]

,

Proceedings of the SIGGRAPH Asia 2013, 2013

3D reconstruction of complex geometric solids from 2D line drawings.

[BibT_eX]

[DOI]

,

Proceedings of the SIGGRAPH Asia 2013, 2013

Visual Saliency Guided Global and Local Resizing for 3D Models.

[BibT_eX]

[DOI]

,

Proceedings of the 2013 International Conference on Computer-Aided Design and Computer Graphics, 2013

Loading...