LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.
CoRR, April, 2025
Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection.
CoRR, January, 2025
Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection.
Comput. Speech Lang., 2025
TSELM: Target Speaker Extraction using Discrete Tokens and Language Models.
CoRR, 2024
USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction.
CoRR, 2024
Efficient Personal Voice Activity Detection with Wake Word Reference Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024
SEF-Net: Speaker Embedding Free Target Speaker Extraction Network.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Robust Audio Anti-spoofing Countermeasure with Joint Training of Front-end and Back-end Models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Low-complexity Multi-Channel Speaker Extraction with Pure Speech Cues.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1).
CoRR, 2022