Towards Unsupervised Speech Recognition Without Pronunciation Models.
CoRR, 2024
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Listen, Decipher and Sign: Toward Unsupervised Speech-to-Sign Language Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Domain Generalization for Language-Independent Automatic Speech Recognition.
Frontiers Artif. Intell., 2022
Improving Self-Supervised Speech Representations by Disentangling Speakers.
CoRR, 2022
Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
WavPrompt: Towards Few-Shot Spoken Language Understanding with Frozen Language Models.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers.
Proceedings of the International Conference on Machine Learning, 2022
Zero-Shot Cross-Lingual Phonetic Recognition with External Language Embedding.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
The Time-Course of Phoneme Category Adaptation in Deep Neural Networks.
Proceedings of the Statistical Language and Speech Processing, 2019