CodecFake-Omni: A Large-Scale Codec-based Deepfake Speech Dataset.
,
,
,
,
,
,
,
,
,
,
CoRR, January, 2025
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR.
CoRR, 2024
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the IEEE International Conference on Acoustics, 2024
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2023
Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences.
Proceedings of the IEEE International Conference on Acoustics, 2023
On the Utility of Self-Supervised Models for Prosody-Related Tasks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022