Minchuan Chen

Orcid: 0009-0001-1512-6672

According to our database1, Minchuan Chen authored at least 18 papers between 2019 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

DFlow: A Generative Model Combining Denoising AutoEncoder and Normalizing Flow for High Fidelity Waveform Generation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

ESVC: Combining Adaptive Style Fusion and Multi-Level Feature Disentanglement for Expressive Singing Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Relative Boundary Modeling: A High-Resolution Cricket Bowl Release Detection Framework with I3D Features.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Exploring Loss Function and Rank Fusion for Enhanced Person Re-identification.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Image- and Instance-Level Data Augmentation for Occluded Instance Segmentation.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Exploring multi-task learning and data augmentation in dementia detection with self-supervised pretrained models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
A compact transformer-based GAN vocoder.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
EfficientSing: A Chinese Singing Voice Synthesis System Using Duration-Free Acoustic Model and HiFi-GAN Vocoder.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improving Polyphone Disambiguation for Mandarin Chinese by Combining Mix-Pooling Strategy and Window-Based Attention.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture.
Proceedings of the 38th International Conference on Machine Learning, 2021

Unsupervised Learning for Multi-Style Speech Synthesis with Limited Data.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Neural Text Normalization with Partial Parameter Generator and Pointer-Generator Network.
Proceedings of the IEEE International Conference on Acoustics, 2021

PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Non-Parallel Voice Conversion with Fewer Labeled Data by Conditional Generative Adversarial Networks.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Nonparallel Emotional Speech Conversion Using VAE-GAN.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Flow-TTS: A Non-Autoregressive Network for Text to Speech Based on Flow.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Cross-Lingual, Multi-Speaker Text-To-Speech Synthesis Using Neural Speaker Embedding.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019


  Loading...