HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis.

[BibT_eX]

[DOI]

Sang-Hoon Lee

Seung-Bin Kim

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Effective Data Augmentation Methods for Neural Text-to-Speech Systems.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Electronics, Information, and Communication, 2022

Linear Prediction-based Parallel WaveGAN Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Electronics, Information, and Communication, 2022

2021

Improved Parallel Wavegan Vocoder with Perceptually Weighted Spectrogram Loss.

[BibT_eX]

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improving LPCNET-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ExcitGlow: Improving a WaveGlow-based Neural Vocoder with Linear Prediction Analysis.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019

Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment.

[BibT_eX]

[DOI]

Min-Jae Hwang

Hong-Goo Kang

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

SVD-Based Adaptive QIM Watermarking on Stereo Audio Signals.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2018

LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis.

[BibT_eX]

[DOI]

CoRR, 2018

A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Modeling-By-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Min-Jae Hwang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...