FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
FunASR: A Fundamental End-to-End Speech Recognition Toolkit.
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Hybrid CTC-Attention based End-to-End Speech Recognition using Subword Units.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018