We stand with Ukraine

We stand with Ukraine

Minchuan Chen

Orcid: 0009-0001-1512-6672

According to our database¹, Minchuan Chen authored at least 18 papers between 2019 and 2024.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

2019

2020

2021

2022

2023

2024

0

1

2

3

4

5

6

7

1

2

4

1

6

3

1

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion.

[BibT_eX]

[DOI]

,

,

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

DFlow: A Generative Model Combining Denoising AutoEncoder and Normalizing Flow for High Fidelity Waveform Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

ESVC: Combining Adaptive Style Fusion and Multi-Level Feature Disentanglement for Expressive Singing Voice Conversion.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Relative Boundary Modeling: A High-Resolution Cricket Bowl Release Detection Framework with I3D Features.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Exploring Loss Function and Rank Fusion for Enhanced Person Re-identification.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Image- and Instance-Level Data Augmentation for Occluded Instance Segmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

Exploring multi-task learning and data augmentation in dementia detection with self-supervised pretrained models.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022

A compact transformer-based GAN vocoder.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

EfficientSing: A Chinese Singing Voice Synthesis System Using Duration-Free Acoustic Model and HiFi-GAN Vocoder.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Improving Polyphone Disambiguation for Mandarin Chinese by Combining Mix-Pooling Strategy and Window-Based Attention.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Unsupervised Learning for Multi-Style Speech Synthesis with Limited Data.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Neural Text Normalization with Partial Parameter Generator and Pointer-Generator Network.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Non-Parallel Voice Conversion with Fewer Labeled Data by Conditional Generative Adversarial Networks.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Nonparallel Emotional Speech Conversion Using VAE-GAN.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Flow-TTS: A Non-Autoregressive Network for Text to Speech Based on Flow.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Cross-Lingual, Multi-Speaker Text-To-Speech Synthesis Using Neural Speaker Embedding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Loading...