Sheng Zhao
Orcid: 0000-0002-9624-5381
According to our database1,
Sheng Zhao
authored at least 116 papers
between 1996 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024
Human-Vehicle Shared Steering Control for Obstacle Avoidance: A Reference-Free Approach With Reinforcement Learning.
IEEE Trans. Intell. Transp. Syst., November, 2024
Big Data Cogn. Comput., September, 2024
IEEE Trans. Pattern Anal. Mach. Intell., June, 2024
CoRR, 2024
Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech.
CoRR, 2024
VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment.
CoRR, 2024
CoRR, 2024
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers.
CoRR, 2024
CoRR, 2024
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations.
CoRR, 2024
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis.
CoRR, 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models.
CoRR, 2024
UniStyle: Unified Style Modeling for Speaking Style Captioning and Stylistic Speech Synthesis.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Two-Stage Optimal Trajectory Planning Based on Resilience Adjustment Model for Virtually Coupled Trains.
IEEE Trans. Intell. Transp. Syst., December, 2023
IEEE J. Sel. Top. Signal Process., November, 2023
Robust adaptive Unscented Kalman Filter with gross error detection and identification for power system forecasting-aided state estimation.
J. Frankl. Inst., September, 2023
The First High-quality Reference Genome of Sika Deer Provides Insights into High-tannin Adaptation.
Genom. Proteom. Bioinform., 2023
CoRR, 2023
CoRR, 2023
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers.
CoRR, 2023
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling.
CoRR, 2023
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
LeanSpeech: The Microsoft Lightweight Speech Synthesis System for Limmits Challenge 2023.
Proceedings of the IEEE International Conference on Acoustics, 2023
Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data Augmentation.
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the 18th Blizzard Challenge Workshop, Grenoble, France, August 29, 2023, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
Towards Contextual Spelling Correction for Customization of End-to-End Speech Recognition Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
CoRR, 2022
BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
MeloForm: Generating Melody with Musical Form based on Expert Systems and Neural Networks.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
DelightfulTTS 2: End-to-End Speech Synthesis with Adversarial Vector-Quantized Auto-Encoders.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Infergrad: Improving Diffusion Models for Vocoder by Considering Inference in Training.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Design and Adaptive Control of Matrix Transformer Based Indirect Converter for Large-Capacity Circuit Breaker Testing Application.
IEEE Trans. Ind. Electron., 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
A Light-Weight Contextual Spelling Correction Model for Customizing Transducer-Based Speech Recognition Systems.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the 9th International Conference on Learning Representations, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the Blizzard Challenge 2021, virtual, October 23, 2021, 2021
2020
Vital Sign Detection during Large-Scale and Fast Body Movements Based on an Adaptive Noise Cancellation Algorithm Using a Single Doppler Radar Sensor.
Sensors, 2020
IEICE Electron. Express, 2020
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
MoBoAligner: A Neural Alignment Model for Non-Autoregressive TTS with Monotonic Boundary Search.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
2019
Correlation Analysis of Breast Cancer DWI Combined with DCE-MRI Imaging Features with Molecular Subtypes and Prognostic Factors.
J. Medical Syst., 2019
J. Medical Syst., 2019
A Methodology of Timing Co-Evolutionary Path Optimization for Accident Emergency Rescue Considering Future Environmental Uncertainty.
IEEE Access, 2019
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019
A Resilience Adjustment Method for Real-time Cooperative Optimization of High-speed Trains.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Proceedings of the 36th International Conference on Machine Learning, 2019
Dilated Residual Network with Multi-head Self-attention for Speech Emotion Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
Knowledge Distillation from Bert in Pre-Training and Fine-Tuning for Polyphone Disambiguation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2018
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018
Proceedings of the 8th IEEE International Conference on Consumer Electronics - Berlin, 2018
2016
High-Precision Vehicle Navigation in Urban Environments Using an MEM's IMU and Single-Frequency GPS Receiver.
IEEE Trans. Intell. Transp. Syst., 2016
Computationally Efficient Carrier Integer Ambiguity Resolution in Multiepoch GPS/INS: A Common-Position-Shift Approach.
IEEE Trans. Control. Syst. Technol., 2016
Synthesis and Characterization of Magnetic Polyvinyl Alcohol (PVA) Hydrogel Microspheres for the Embolization of Blood Vessel.
IEEE Trans. Biomed. Eng., 2016
2015
Int. J. Online Eng., 2015
2014
Proceedings of the 53rd IEEE Conference on Decision and Control, 2014
2013
Quaternion-based trajectory tracking control of VTOL-UAVs using command filtered backstepping.
Proceedings of the American Control Conference, 2013
Proceedings of the IEEE International Conference on Control Applications, 2013
2012
Int. J. Robotics Autom., 2012
Proceedings of the 11th IEEE International Conference on Trust, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 Second International Conference on Cloud and Green Computing, 2012
2011
Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference, 2011
Proceedings of the American Control Conference, 2011
2010
Proceedings of the American Control Conference, 2010
2008
2005
J. Comput. Biol., 2005
2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Proceedings of the 2002 International Symposium on Chinese Spoken Language Processing, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the First Workshop on Chinese Language Processing, 2002
1997
Proceedings of International Conference on Neural Networks (ICNN'97), 1997
1996
A Protein Class Database Organized with ProSite Protein Groups and PIR Superfamilies.
J. Comput. Biol., 1996
Comput. Appl. Biosci., 1996