Yong Xu
Orcid: 0000-0003-4944-6890Affiliations:
- Tencent America LLC, Seattle, USA
- University of Surrey, Centre for Vision, Speech and Signal Processing, Guildford, UK (former)
- University of Science and Technology of China, Hefei, China (PhD 2015)
According to our database1,
Yong Xu
authored at least 90 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on github.com
On csauthors.net:
Bibliography
2024
Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking.
IEEE Trans. Multim., 2024
CoRR, 2024
LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization.
CoRR, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the 32nd European Signal Processing Conference, 2024
2023
CoRR, 2023
Zoneformer: On-device Neural Beamformer For In-car Multi-zone Speech Separation, Enhancement and Echo Cancellation.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Neuralecho: Hybrid of Full-Band and Sub-Band Recurrent Neural Network For Acoustic Echo Cancellation and Speech Enhancement.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
CoRR, 2022
NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement.
CoRR, 2022
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 30th European Signal Processing Conference, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
CoRR, 2021
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Neural Mask based Multi-channel Convolutional Beamforming for Joint Dereverberation, Echo Cancellation and Denoising.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Audio-Visual Speech Separation and Dereverberation With a Two-Stage Multimodal Network.
IEEE J. Sel. Top. Signal Process., 2020
IEEE J. Sel. Top. Signal Process., 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Far-Field Location Guided Target Speech Extraction Using End-to-End Speech Recognition Objectives.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Cross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems.
CoRR, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
Joint Training of Complex Ratio Mask Based Beamformer and Acoustic Model for Noise Robust Asr.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
2018
Auxiliary Features from Laser-Doppler Vibrometer Sensor for Deep Neural Network Based Robust Speech Recognition.
J. Signal Process. Syst., 2018
Iterative Deep Neural Networks for Speaker-Independent Binaural Blind Speech Separation.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
A Joint Separation-Classification Model for Sound Event Detection of Weakly Labelled Data.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks.
Proceedings of the Latent Variable Analysis and Signal Separation, 2018
Supporting Audiography: Design of a System for Sentimental Sound Recording, Classification and Playback.
Proceedings of the HCI International 2018, 2018
Proceedings of the 26th European Signal Processing Conference, 2018
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018
2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Pattern Recognit., 2017
Binaural and log-power spectra features with deep neural networks for speech-noise separation.
Proceedings of the 19th IEEE International Workshop on Multimedia Signal Processing, 2017
Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Convolutional gated recurrent neural network incorporating spatial features for audio tagging.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Joint detection and classification convolutional neural network on weakly labelled bird audio detection.
Proceedings of the 25th European Signal Processing Conference, 2017
2016
Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition.
EURASIP J. Adv. Signal Process., 2016
Fully Deep Neural Networks Incorporating Unsupervised Feature Learning for Audio Tagging.
CoRR, 2016
Deep neural network for robust speech recognition with auxiliary features from laser-Doppler vibrometer sensor.
Proceedings of the 10th International Symposium on Chinese Spoken Language Processing, 2016
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
DNN-based speech bandwidth expansion and its application to adding high-frequency missing features for automatic speech recognition of narrowband speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the Latent Variable Analysis and Signal Separation, 2015
2014
IEEE Signal Process. Lett., 2014
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Speech separation based on improved deep neural networks with dual outputs of speech features for both target and interfering speakers.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Global variance equalization for improving deep neural network based speech enhancement.
Proceedings of the IEEE China Summit & International Conference on Signal and Information Processing, 2014
2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012