Hao Zhang

Orcid: 0009-0003-8392-3977

Affiliations:

University of California, San Diego, CA, USA
University of California, Berkeley, CA, USA

According to our database¹, Hao Zhang authored at least 24 papers between 2021 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Efficient LLM Scheduling by Learning to Rank.

[BibT_eX]

[DOI]

CoRR, 2024

MPC-Minimized Secure LLM Inference.

[BibT_eX]

[DOI]

CoRR, 2024

Empowering 1000 tokens/second on-device LLM prefilling with mllm-NPU.

[BibT_eX]

[DOI]

CoRR, 2024

Optimizing Speculative Decoding for Serving Large Language Models Using Goodput.

[BibT_eX]

[DOI]

CoRR, 2024

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length.

[BibT_eX]

[DOI]

CoRR, 2024

Toward Inference-optimal Mixture-of-Expert Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

MuxServe: Flexible Multiplexing for Efficient Multiple LLM Serving.

[BibT_eX]

[DOI]

CoRR, 2024

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving.

[BibT_eX]

[DOI]

Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

Online Speculative Decoding.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Break the Sequential Dependency of LLM Inference Using Lookahead Decoding.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference.

[BibT_eX]

[DOI]

Wei-Lin Chiang

Lianmin Zheng

Ying Sheng

Anastasios Nikolas Angelopoulos

Proceedings of the Forty-first International Conference on Machine Learning, 2024

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

WiP: Efficient LLM Prefilling with Mobile NPU.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Edge and Mobile Foundation Models, 2024

2023

LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers.

[BibT_eX]

[DOI]

CoRR, 2023

Efficient Memory Management for Large Language Model Serving with PagedAttention.

[BibT_eX]

[DOI]

Proceedings of the 29th Symposium on Operating Systems Principles, 2023

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving.

[BibT_eX]

[DOI]

Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation, 2023

Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

On Optimizing the Communication of Model Parallelism.

[BibT_eX]

[DOI]

Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

2022

Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the 16th USENIX Symposium on Operating Systems Design and Implementation, 2022

2021

Simple and Automatic Distributed Machine Learning on Ray.

[BibT_eX]

[DOI]

Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Hao Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...