We stand with Ukraine

We stand with Ukraine

Seung-Bin Kim

Orcid: 0000-0002-2287-9111

According to our database¹, Seung-Bin Kim authored at least 10 papers between 2022 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching.

[BibT_eX]

[DOI]

,

,

CoRR, January, 2025

JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

CoRR, January, 2025

2024

Audio Super-Resolution With Robust Speech Representation Learning of Masked Autoencoder.

[BibT_eX]

[DOI]

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2024

EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

PromotiCon: Prompt-based Emotion Controllable Text-to-Speech via Prompt Generation and Matching.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

TranSentence: speech-to-speech Translation via Language-Agnostic Sentence-Level Speech Encoding without Language-Parallel Data.

[BibT_eX]

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2023

2022

HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

EMOQ-TTS: Emotion Intensity Quantization for Fine-Grained Controllable Emotional Text-to-Speech.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2022

Loading...