Xiao-Lei Zhang
Orcid: 0000-0001-7694-193XAffiliations:
- Northwestern Polytechnical University, Center for Intelligent Acoustics and Immersive Communications, CIAIC, School of Marine Science and Technology, China
- Tsinghua University, Department of Electronic Engineering, Beijing, China
- Ohio State University, Department of Computer Science and Engineering, Columbus, OH, USA (2013-2014)
- Tsinghua University, Department of Information and Communication Engineering, Beijing, China (PhD 2012)
According to our database1,
Xiao-Lei Zhang
authored at least 87 papers
between 2010 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Multi-Resolution Convolutional Residual Neural Networks for Monaural Speech Dereverberation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Learning Multi-Dimensional Speaker Localization: Axis Partitioning, Unbiased Label Distribution, and Data Augmentation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE Signal Process. Lett., 2024
Exploiting A Quantum Multiple Kernel Learning Approach For Low-Resource Spoken Command Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
End-to-End Multi-Modal Speech Recognition on an Air and Bone Conducted Speech Corpus.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
LMD: A Learnable Mask Network to Detect Adversarial Examples for Speaker Verification.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
IEEE Signal Process. Lett., 2023
Spatial-temporal Graph Based Multi-channel Speaker Verification With Ad-hoc Microphone Arrays.
CoRR, 2023
Branch-ECAPA-TDNN: A Parallel Branch Architecture to Capture Local and Global Features for Speaker Verification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Optimizing Quantum Federated Learning Based on Federated Quantum Natural Gradient Descent.
Proceedings of the IEEE International Conference on Acoustics, 2023
Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames.
Proceedings of the IEEE International Conference on Acoustics, 2023
2022
End-to-End Speaker Verification via Curriculum Bipartite Ranking Weighted Binary Cross-Entropy.
IEEE ACM Trans. Audio Speech Lang. Process., 2022
Deep ad-hoc beamforming based on speaker extraction for target-dependent speech separation.
Speech Commun., 2022
EURASIP J. Audio Speech Music. Process., 2022
Deep Learning Based Two-dimensional Speaker Localization With Large Ad-hoc Microphone Arrays.
CoRR, 2022
CoRR, 2022
Comput. Biol. Medicine, 2022
Multi-class AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Multi-Channel Far-Field Speaker Verification with Large-Scale Ad-hoc Microphone Arrays.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
Minimum-Volume Multichannel Nonnegative Matrix Factorization for Blind Audio Source Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021
Frame-level multi-channel speaker verification with large-scale ad-hoc microphone arrays.
CoRR, 2021
AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data.
CoRR, 2021
Minimum-volume Multichannel Nonnegative matrix factorization for blind source separation.
CoRR, 2021
Scaling Sparsemax Based Channel Selection for Speech Recognition with ad-hoc Microphone Arrays.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Transformer-Based End-to-End Speech Recognition with Local Dense Synthesizer Attention.
Proceedings of the IEEE International Conference on Acoustics, 2021
Speech Enhancement Aided End-To-End Multi-Task Learning for Voice Activity Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021
A comparison of handcrafted, parameterized, and learnable features for speech separation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Speaker Verification by Partial AUC Optimization With Mahalanobis Distance Metric Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Depthwise Separable Convolutional ResNet with Squeeze-and-Excitation Blocks for Small-Footprint Keyword Spotting.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 25th International Conference on Pattern Recognition, 2020
Partial AUC Optimization Based Deep Speaker Embeddings with Class-Center Learning for Text-Independent Speaker Verification.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Multi-channel Speech Separation Using Deep Embedding With Multilayer Bootstrap Networks.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Multi-channel Speech Separation Using Deep Embedding Model with Multilayer Bootstrap Networks.
CoRR, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Boosting Spatial Information for Deep Learning Based Multichannel Speaker-Independent Speech Separation In Reverberant Environments.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Domain Adaptation Neural Network for Acoustic Scene Classification in Mismatched Conditions.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Boosting Contextual Information for Deep Neural Network Based Voice Activity Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Universal Background Sparse Coding and Multilayer Bootstrap Network for Speaker Clustering.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
Heuristic Ternary Error-Correcting Output Codes Via Weight Optimization and Layered Clustering-Based Approach.
IEEE Trans. Cybern., 2015
IEEE Trans. Pattern Anal. Mach. Intell., 2015
Universal Background Sparse Coding and Multilayer Bootstrap Network for Speaker Recognition.
CoRR, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Boosted deep neural networks and multi-resolution cochleagram features for voice activity detection.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Unsupervised domain adaptation for deep neural network based voice activity detection.
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the Sixth Asian Conference on Machine Learning, 2014
2013
IEEE Trans. Speech Audio Process., 2013
Heuristic Ternary Error-Correcting Output Codes Via Weight Optimization and Layered Clustering-Based Approach
CoRR, 2013
Transfer Learning for Voice Activity Detection: A Denoising Deep Neural Network Perspective
CoRR, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
IEEE Trans. Syst. Man Cybern. Part B, 2012
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
IEEE Signal Process. Lett., 2011
Maximum Margin Clustering Based Statistical VAD With Multiple Observation Compound Feature.
IEEE Signal Process. Lett., 2011
An efficient voice activity detection algorithm by combining statistical model and energy detection.
EURASIP J. Adv. Signal Process., 2011
2010
A new VAD framework using statistical model and human knowledge based empirical rule.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010