Seung-Bin Kim
Orcid: 0000-0002-2287-9111
According to our database1,
Seung-Bin Kim
authored at least 10 papers
between 2022 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching.
CoRR, January, 2025
JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis.
CoRR, January, 2025
2024
Audio Super-Resolution With Robust Speech Representation Learning of Masked Autoencoder.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector.
CoRR, 2024
EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech.
CoRR, 2024
PromotiCon: Prompt-based Emotion Controllable Text-to-Speech via Prompt Generation and Matching.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024
TranSentence: speech-to-speech Translation via Language-Agnostic Sentence-Level Speech Encoding without Language-Parallel Data.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis.
CoRR, 2023
2022
HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
EMOQ-TTS: Emotion Intensity Quantization for Fine-Grained Controllable Emotional Text-to-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022