Yichong Leng

Orcid: 0009-0003-3440-074X

According to our database1, Yichong Leng authored at least 30 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

Qwen2-Audio Technical Report.
CoRR, 2024

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models.
CoRR, 2024

Sentence-Level or Token-Level? A Comprehensive Study on Knowledge Distillation.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PromptTTS 2: Describing and Generating Voices with Text Prompt.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
PromptTTS 2: Describing and Generating Voices with Text Prompt.
CoRR, 2023

Extract and Attend: Improving Entity Translation in Neural Machine Translation.
CoRR, 2023

Retriever and Ranker Framework with Probabilistic Hard Negative Sampling for Code Search.
CoRR, 2023

NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers.
CoRR, 2023

Prompttts: Controllable Text-To-Speech With Text Descriptions.
Proceedings of the IEEE International Conference on Acoustics, 2023

Extract and Attend: Improving Entity Translation in Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech.
CoRR, 2022

Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

BinauralGrad: A Two-Stage Conditional Diffusion Probabilistic Model for Binaural Audio Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Analyzing and Mitigating Interference in Neural Architecture Search.
Proceedings of the International Conference on Machine Learning, 2022

A Study on the Efficacy of Model Pre-Training In Developing Neural Text-to-Speech System.
Proceedings of the IEEE International Conference on Acoustics, 2022

Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition.
CoRR, 2021

FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Speech-T: Transducer for Text to Speech and Beyond.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

MBNET: MOS Prediction for Synthesized Speech with Mean-Bias Network.
Proceedings of the IEEE International Conference on Acoustics, 2021

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2019
A Study of Multilingual Neural Machine Translation.
CoRR, 2019

Microsoft Research Asia's Systems for WMT19.
CoRR, 2019

Microsoft Research Asia's Systems for WMT19.
Proceedings of the Fourth Conference on Machine Translation, 2019

Unsupervised Pivot Translation for Distant Languages.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019


  Loading...