Shuai Zheng

Orcid: 0000-0003-3093-6486

Affiliations:

Amazon Web Services
Hong Kong University of Science and Technology, Department of Computer Science and Engineering (former)
Chinese Academy of Sciences, Institute of Computing Technology (former)

According to our database¹, Shuai Zheng authored at least 33 papers between 2012 and 2024.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

DISTMM: Accelerating Distributed Multimodal Model Training.

[BibT_eX]

[DOI]

Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping.

[BibT_eX]

[DOI]

Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines.

[BibT_eX]

[DOI]

Proceedings of the Nineteenth European Conference on Computer Systems, 2024

Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training.

[BibT_eX]

[DOI]

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

Decoupled Model Schedule for Deep Learning Training.

[BibT_eX]

[DOI]

CoRR, 2023

GEMINI: Fast Failure Recovery in Distributed Training with In-Memory Checkpoints.

[BibT_eX]

[DOI]

Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022

MiCS: Near-linear Scaling for Training Gigantic Model on Public Cloud.

[BibT_eX]

[DOI]

Proc. VLDB Endow., 2022

SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning.

[BibT_eX]

[DOI]

CoRR, 2022

SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing.

[BibT_eX]

[DOI]

CoRR, 2022

Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems.

[BibT_eX]

[DOI]

CoRR, 2022

DCAF-BERT: A Distilled Cachable Adaptable Factorized Model For Improved Ads CTR Prediction.

[BibT_eX]

[DOI]

Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022

Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems.

[BibT_eX]

[DOI]

Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Removing Batch Normalization Boosts Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

2020

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.

[BibT_eX]

[DOI]

J. Mach. Learn. Res., 2020

Accelerated Large Batch Optimization of BERT Pretraining in 54 minutes.

[BibT_eX]

[DOI]

CoRR, 2020

CSER: Communication-efficient SGD with Error Reset.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.

[BibT_eX]

[DOI]

CoRR, 2019

Blockwise Adaptivity: Faster Training and Better Generalization in Deep Learning.

[BibT_eX]

[DOI]

Shuai Zheng

James T. Kwok

CoRR, 2019

Give Me a Hint! Navigating Image Databases Using Human-in-the-Loop Feedback.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Communication-Efficient Distributed Blockwise Momentum SGD with Error-Feedback.

[BibT_eX]

[DOI]

Shuai Zheng

Ziyue Huang

James T. Kwok

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

ModaNet: A Large-scale Street Fashion Dataset with Polygon Annotations.

[BibT_eX]

[DOI]

Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Lightweight Stochastic Optimization for Minimizing Finite Sums with Infinite Data.

[BibT_eX]

[DOI]

Shuai Zheng

James Tin-Yau Kwok

Proceedings of the 35th International Conference on Machine Learning, 2018

Conditional Image-Text Embedding Networks.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2018, 2018

2017

Follow the Moving Leader in Deep Learning.

[BibT_eX]

[DOI]

Shuai Zheng

James T. Kwok

Proceedings of the 34th International Conference on Machine Learning, 2017

2016

Fast-and-Light Stochastic ADMM.

[BibT_eX]

[DOI]

Shuai Zheng

James T. Kwok

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Asynchronous Distributed Semi-Stochastic Gradient Optimization.

[BibT_eX]

[DOI]

Ruiliang Zhang

Shuai Zheng

James T. Kwok

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Fast Nonsmooth Regularized Risk Minimization with Continuation.

[BibT_eX]

[DOI]

Shuai Zheng

Ruiliang Zhang

James T. Kwok

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Fast Distributed Asynchronous SGD with Variance Reduction.

[BibT_eX]

[DOI]

Ruiliang Zhang

Shuai Zheng

James T. Kwok

CoRR, 2015

2014

Accurate Integration of Aerosol Predictions by Smoothing on a Manifold.

[BibT_eX]

[DOI]

Shuai Zheng

James T. Kwok

Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013

Flexible navigation in smartphones and tablets using scalable storyboards.

[BibT_eX]

[DOI]

Shuai Zheng

Luis Herranz

Shuqiang Jiang

Proceedings of the International Conference on Multimedia Retrieval, 2013

2012

Basis Technology at TAC 2012 Entity Linking.

[BibT_eX]

[DOI]

Proceedings of the Fifth Text Analysis Conference, 2012

Shuai Zheng

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...