Chen Zhang

Orcid: 0009-0004-3596-4683

Affiliations:
  • Zhejiang University, Hangzhou, China


According to our database1, Chen Zhang authored at least 24 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
NaturalSpeech: End-to-End Text-to-Speech Synthesis With Human-Level Quality.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2024

2023
C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model.
CoRR, 2023

Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias.
CoRR, 2023

Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis.
CoRR, 2023

Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation.
CoRR, 2023

SongDriver2: Real-time Emotion-based Music Arrangement with Soft Transition.
CoRR, 2023

Bag of Tricks for Unsupervised Text-to-Speech.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

LeanSpeech: The Microsoft Lightweight Speech Synthesis System for Limmits Challenge 2023.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing.
CoRR, 2022

SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation.
CoRR, 2022

S3T: Self-Supervised Pre-training with Swin Transformer for Music Classification.
CoRR, 2022

Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ReLyMe: Improving Lyric-to-Melody Generation by Incorporating Lyric-Melody Relationships.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

SongDriver: Real-time Music Accompaniment Generation without Logical Latency nor Exposure Bias.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

S3T: Self-Supervised Pre-Training with Swin Transformer For Music Classification.
Proceedings of the IEEE International Conference on Acoustics, 2022

TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Automatic Song Translation for Tonal Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Denoispeech: Denoising Text to Speech with Frame-Level Noise Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2021

UWSpeech: Speech to Speech Translation for Unwritten Languages.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

SimulSpeech: End-to-End Simultaneous Speech to Text Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Discriminative and Correlative Partial Multi-Label Learning.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019


  Loading...