Early Joint Learning of Emotion Information Makes MultiModal Model Understand You Better.
CoRR, 2024
Early Joint Learning of Emotion Information Makes MultiModal Model Understand You Better.
Proceedings of the 2nd International Workshop on Multimodal and Responsible Affective Computing, 2024
ARVC: An Auto-Regressive Voice Conversion System Without Parallel Training Data.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Residual Convolutional CTC Networks for Automatic Speech Recognition.
CoRR, 2017