Seyun Um

Orcid: 0000-0002-2229-6741

Affiliations:
  • Yonsei University, Seoul, South Korea


According to our database1, Seyun Um authored at least 15 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild.
CoRR, 2024

Text-To-Speech Synthesis In The Wild.
CoRR, 2024

SC-ERM: Speaker-Centric Learning for Speech Emotion Recognition.
Proceedings of the International Conference on Electronics, Information, and Communication, 2024

2023
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems.
IEEE Signal Process. Lett., 2023

Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Facetron: A Multi-Speaker Face-to-Speech Model Based on Cross-Modal Latent Representations.
Proceedings of the 31st European Signal Processing Conference, 2023

Consideration of Varying Training Lengths for Short-Duration Speaker Verification.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence.
CoRR, 2022

AILTTS: Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech.
CoRR, 2022

Length-Normalized Representation Learning for Speech Signals.
IEEE Access, 2022

FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Light-Weight Speaker Verification with Global Context Information.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Facetron: Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations.
CoRR, 2021

LiteTTS: A Lightweight Mel-Spectrogram-Free Text-to-Wave Synthesizer Based on Generative Adversarial Networks.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Emotional Speech Synthesis with Rich and Granularized Control.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


  Loading...