Eunwoo Song
According to our database1,
Eunwoo Song
authored at least 39 papers
between 2013 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Training Universal Vocoders with Feature Smoothing-Based Augmentation Methods for High-Quality TTS Systems.
CoRR, 2024
Enhancing Multilingual TTS with Voice Conversion Based Data Augmentation and Posterior Embedding.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the International Conference on Electronics, Information, and Communication, 2022
Proceedings of the International Conference on Electronics, Information, and Communication, 2022
2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators.
Proceedings of the IEEE International Conference on Acoustics, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Parallel Wavegan: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Improving LPCNET-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
ExcitGlow: Improving a WaveGlow-based Neural Vocoder with Linear Prediction Analysis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems.
CoRR, 2019
Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 27th European Signal Processing Conference, 2019
2018
Speaker-adaptive neural vocoders for statistical parametric speech synthesis systems.
CoRR, 2018
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Modeling-By-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Effective Spectral and Excitation Modeling Techniques for LSTM-RNN-Based Speech Synthesis Systems.
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Perceptual quality and modeling accuracy of excitation parameters in DLSTM-based speech synthesis systems.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
2016
Improved Time-Frequency Trajectory Excitation Vocoder for DNN-Based Speech Synthesis.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Multi-class learning algorithm for deep neural network-based statistical parametric speech synthesis.
Proceedings of the 24th European Signal Processing Conference, 2016
Area-efficient one-cycle correction scheme for timing errors in flip-flop based pipelines.
Proceedings of the IEEE Asian Solid-State Circuits Conference, 2016
2015
Deep neural network-based statistical parametric speech synthesis system using improved time-frequency trajectory excitation model.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Improved time-frequency trajectory excitation modeling for a statistical parametric speech synthesis system.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015
2014
Proceedings of the 19th International Conference on Digital Signal Processing, 2014
2013
Speech enhancement for pathological voice using time-frequency trajectory excitation modeling.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013