Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features.
CoRR, 2024
Binaural Selective Attention Model for Target Speaker Extraction.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024
Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023