Min-Jae Hwang

Orcid: 0000-0002-7376-009X

According to our database1, Min-Jae Hwang authored at least 23 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference.
CoRR, 2024

Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Seamless: Multilingual Expressive and Streaming Speech Translation.
CoRR, 2023

SeamlessM4T-Massively Multilingual & Multimodal Machine Translation.
CoRR, 2023

2022
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Effective Data Augmentation Methods for Neural Text-to-Speech Systems.
Proceedings of the International Conference on Electronics, Information, and Communication, 2022

Linear Prediction-based Parallel WaveGAN Speech Synthesis.
Proceedings of the International Conference on Electronics, Information, and Communication, 2022

2021
Improved Parallel Wavegan Vocoder with Perceptually Weighted Spectrogram Loss.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators.
Proceedings of the IEEE International Conference on Acoustics, 2021

TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Improving LPCNET-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ExcitGlow: Improving a WaveGlow-based Neural Vocoder with Linear Prediction Analysis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Parameter Enhancement for MELP Speech Codec in Noisy Communication Environment.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
SVD-Based Adaptive QIM Watermarking on Stereo Audio Signals.
IEEE Trans. Multim., 2018

LP-WaveNet: Linear Prediction-based WaveNet Speech Synthesis.
CoRR, 2018

A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Modeling-By-Generation-Structured Noise Compensation Algorithm for Glottal Vocoding Speech Synthesis System.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018


  Loading...