2025

CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset.

[DOI]

Jiawei Du

Xuanjun Chen

CoRR, January, 2025

2024

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR.

[DOI]

Liang-Hsuan Tseng

En-Pei Hu

David Cheng-Han Chiang

CoRR, 2024

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems.

[DOI]

Haibin Wu

Yuan Tseng

Hung-yi Lee

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models.

[DOI]

CoRR, 2023

Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences.

[DOI]

Yuan Tseng

Cheng-I Jeff Lai

Hung-Yi Lee

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

On the Utility of Self-Supervised Models for Prosody-Related Tasks.

[DOI]

Proceedings of the IEEE Spoken Language Technology Workshop, 2022