Heyang Qin

Orcid: 0000-0003-0994-502X

According to our database1, Heyang Qin authored at least 12 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone.
CoRR, 2024

DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference.
CoRR, 2024

ZeRO++: Extremely Efficient Collective Communication for Large Model Training.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention.
CoRR, 2023

DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales.
CoRR, 2023

ZeRO++: Extremely Efficient Collective Communication for Giant Model Training.
CoRR, 2023

2022
Scalable and Efficient Machine Learning as a Service.
PhD thesis, 2022

Nemo: An Open-Source Transformer-Supercharged Benchmark for Fine-Grained Wildfire Smoke Detection.
Remote. Sens., 2022

2021
SimiGrad: Fine-Grained Adaptive Batching for Large Scale Training using Gradient Similarity Measurement.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

The Age of Correlated Features in Supervised Learning based Forecasting.
Proceedings of the 2021 IEEE Conference on Computer Communications Workshops, 2021

2020
Reinforcement-Learning-Empowered MLaaS Scheduling for Serving Intelligent Internet of Things.
IEEE Internet Things J., 2020

2019
Swift machine learning model serving scheduling: a region based reinforcement learning approach.
Proceedings of the International Conference for High Performance Computing, 2019


  Loading...