Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Compressing speaker extraction model with ultra-low precision quantization and knowledge distillation.
Neural Networks, 2022
LiMuSE: Lightweight Multi-Modal Speaker Extraction.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
Proceedings of the International Joint Conference on Neural Networks, 2021
Towards Modeling Auditory Restoration in Noisy Environments.
Proceedings of the International Joint Conference on Neural Networks, 2021
Online Audio-Visual Speech Separation with Generative Adversarial Training.
Proceedings of the ICCAI '21: 2021 7th International Conference on Computing and Artificial Intelligence, Tianjin China, April 23, 2021
Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Environments.
Proceedings of the IEEE International Conference on Acoustics, 2021
A biologically plausible supervised learning method for spiking neural networks using the symmetric STDP rule.
Neural Networks, 2020
Audio-visual Speech Separation with Adversarially Disentangled Visual Representation.
CoRR, 2020
A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
LISNN: Improving Spiking Neural Networks with Lateral Interactions for Robust Object Recognition.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020