2025
LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.
CoRR, April, 2025

Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection.
CoRR, January, 2025

Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection.
Comput. Speech Lang., 2025

2024
TSELM: Target Speaker Extraction using Discrete Tokens and Language Models.
CoRR, 2024

USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction.
CoRR, 2024

Efficient Personal Voice Activity Detection with Wake Word Reference Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
SEF-Net: Speaker Embedding Free Target Speaker Extraction Network.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Robust Audio Anti-spoofing Countermeasure with Joint Training of Front-end and Back-end Models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Low-complexity Multi-Channel Speaker Extraction with Pure Speech Cues.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1).
CoRR, 2022