Byoung Jin Choi

Orcid: 0000-0003-1319-8215

According to our database1, Byoung Jin Choi authored at least 16 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Transfer Learning for Low-Resource, Multi-Lingual, and Zero-Shot Multi-Speaker Text-to-Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Variable-Length Speaker Conditioning in Flow-Based Text-to-Speech.
IEEE Signal Process. Lett., 2024

MakeSinger: A Semi-Supervised Training Method for Data-Efficient Singing Voice Synthesis via Classifier-free Diffusion Guidance.
CoRR, 2024

Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction.
CoRR, 2024

2023
Transduce and Speak: Neural Transducer for Text-To-Speech with Semantic Token Prediction.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
SNAC: Speaker-Normalized Affine Coupling Layer in Flow-Based Architecture for Zero-Shot Multi-Speaker Text-to-Speech.
IEEE Signal Process. Lett., 2022

A Controllable Multi-Lingual Multi-Speaker Multi-Style Text-to-Speech Synthesis With Multivariate Information Minimization.
IEEE Signal Process. Lett., 2022

Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech.
CoRR, 2022

Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Expressive Text-to-Speech Using Style Tag.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Diff-TTS: A Denoising Diffusion Model for Text-to-Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Memory Attention: Robust Alignment Using Gating Mechanism for End-to-End Speech Synthesis.
IEEE Signal Process. Lett., 2020

Convex feasibility problems on uniformly convex metric spaces.
Optim. Methods Softw., 2020

WaveNODE: A Continuous Normalizing Flow for Speech Synthesis.
CoRR, 2020

Reformer-TTS: Neural Speech Synthesis with Reformer Network.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018


  Loading...