2025

LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.

[DOI]

Beilong Tang

Bang Zeng

Ming Li

CoRR, April, 2025

Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection.

[DOI]

Bang Zeng

Ming Li

CoRR, January, 2025

Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection.

[DOI]

Bang Zeng

Ming Li

Comput. Speech Lang., 2025

2024

TSELM: Target Speaker Extraction using Discrete Tokens and Language Models.

[DOI]

Beilong Tang

Bang Zeng

Ming Li

CoRR, 2024

USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction.

[DOI]

Bang Zeng

Ming Li

CoRR, 2024

Efficient Personal Voice Activity Detection with Wake Word Reference Speech.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

SEF-Net: Speaker Embedding Free Target Speaker Extraction Network.

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Robust Audio Anti-spoofing Countermeasure with Joint Training of Front-end and Back-end Models.

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Low-complexity Multi-Channel Speaker Extraction with Pure Speech Cues.

[DOI]

Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022

Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios(V1).

[DOI]

CoRR, 2022