2024

Towards Unsupervised Speech Recognition Without Pronunciation Models.

[DOI]

Junrui Ni

Liming Wang

Yang Zhang

Kaizhi Qian

Heting Gao

Mark Hasegawa-Johnson

Chang D. Yoo

CoRR, 2024

Speech Self-Supervised Learning Using Diffusion Model Synthetic Data.

[DOI]

Mark A. Hasegawa-Johnson

Shiyu Chang

Yang Zhang

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Listen, Decipher and Sign: Toward Unsupervised Speech-to-Sign Language Recognition.

[DOI]

Mark Hasegawa-Johnson

Chang Dong Yoo

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Domain Generalization for Language-Independent Automatic Speech Recognition.

[DOI]

Mark Hasegawa-Johnson

Frontiers Artif. Intell., 2022

Improving Self-Supervised Speech Representations by Disentangling Speakers.

[DOI]

Mark Hasegawa-Johnson

Shiyu Chang

CoRR, 2022

Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition.

[DOI]

Mark Hasegawa-Johnson

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

WavPrompt: Towards Few-Shot Spoken Language Understanding with Frozen Language Models.

[DOI]

Mark Hasegawa-Johnson

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers.

[DOI]

Mark Hasegawa-Johnson

Shiyu Chang

Proceedings of the International Conference on Machine Learning, 2022

2021

Zero-Shot Cross-Lingual Phonetic Recognition with External Language Embedding.

[DOI]

Mark Hasegawa-Johnson

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2019

The Time-Course of Phoneme Category Adaptation in Deep Neural Networks.

[DOI]

Junrui Ni

Mark Hasegawa-Johnson

Odette Scharenborg

Proceedings of the Statistical Language and Speech Processing, 2019