Shuai Zheng

Orcid: 0000-0003-3093-6486

Affiliations:
  • Amazon Web Services
  • Hong Kong University of Science and Technology, Department of Computer Science and Engineering (former)
  • Chinese Academy of Sciences, Institute of Computing Technology (former)


According to our database1, Shuai Zheng authored at least 33 papers between 2012 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
DISTMM: Accelerating Distributed Multimodal Model Training.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapping.
Proceedings of the Seventh Annual Conference on Machine Learning and Systems, 2024

DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines.
Proceedings of the Nineteenth European Conference on Computer Systems, 2024

Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
Decoupled Model Schedule for Deep Learning Training.
CoRR, 2023

GEMINI: Fast Failure Recovery in Distributed Training with In-Memory Checkpoints.
Proceedings of the 29th Symposium on Operating Systems Principles, 2023

Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

2022
MiCS: Near-linear Scaling for Training Gigantic Model on Public Cloud.
Proc. VLDB Endow., 2022

SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning.
CoRR, 2022

SMILE: Scaling Mixture-of-Experts with Efficient Bi-level Routing.
CoRR, 2022

Alexa Teacher Model: Pretraining and Distilling Multi-Billion-Parameter Encoders for Natural Language Understanding Systems.
CoRR, 2022

DCAF-BERT: A Distilled Cachable Adaptable Factorized Model For Improved Ads CTR Prediction.
Proceedings of the Companion of The Web Conference 2022, Virtual Event / Lyon, France, April 25, 2022


Partial and Asymmetric Contrastive Learning for Out-of-Distribution Detection in Long-Tailed Recognition.
Proceedings of the International Conference on Machine Learning, 2022

Removing Batch Normalization Boosts Adversarial Training.
Proceedings of the International Conference on Machine Learning, 2022

2020
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.
J. Mach. Learn. Res., 2020

Accelerated Large Batch Optimization of BERT Pretraining in 54 minutes.
CoRR, 2020

CSER: Communication-efficient SGD with Error Reset.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

2019
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.
CoRR, 2019

Blockwise Adaptivity: Faster Training and Better Generalization in Deep Learning.
CoRR, 2019

Give Me a Hint! Navigating Image Databases Using Human-in-the-Loop Feedback.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Communication-Efficient Distributed Blockwise Momentum SGD with Error-Feedback.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
ModaNet: A Large-scale Street Fashion Dataset with Polygon Annotations.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Lightweight Stochastic Optimization for Minimizing Finite Sums with Infinite Data.
Proceedings of the 35th International Conference on Machine Learning, 2018

Conditional Image-Text Embedding Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Follow the Moving Leader in Deep Learning.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Fast-and-Light Stochastic ADMM.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Asynchronous Distributed Semi-Stochastic Gradient Optimization.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

Fast Nonsmooth Regularized Risk Minimization with Continuation.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Fast Distributed Asynchronous SGD with Variance Reduction.
CoRR, 2015

2014
Accurate Integration of Aerosol Predictions by Smoothing on a Manifold.
Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014

2013
Flexible navigation in smartphones and tablets using scalable storyboards.
Proceedings of the International Conference on Multimedia Retrieval, 2013

2012
Basis Technology at TAC 2012 Entity Linking.
Proceedings of the Fifth Text Analysis Conference, 2012


  Loading...