2024
Building a Taiwanese Mandarin Spoken Language Model: A First Attempt.
CoRR, 2024

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR.
CoRR, 2024

Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation For Code-Switching ASR Using Realistic Data.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

2023
Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation.
CoRR, 2023

Introducing Semantics into Speech Encoders.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Introducing Semantics into Speech Encoders.
CoRR, 2022

Improving Generalizability of Distilled Self-Supervised Speech Processing Models Under Distorted Settings.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

2021
Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models.
CoRR, 2021