FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM Integration.
CoRR, January, 2025
FireRedTTS: A Foundation Text-To-Speech Framework for Industry-Level Generative Speech Applications.
CoRR, 2024
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Investigating LSTM for punctuation prediction.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016