Xiaodong Li

Orcid: 0000-0002-4170-0076

Affiliations:
  • Chinese Academy of Science, Institute of Acoustics, Beijing, China (PhD 1995)


According to our database1, Xiaodong Li authored at least 85 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
TaBE: Decoupling spatial and spectral processing with Taylor's unfolding method in the beamspace domain for multi-channel speech enhancement.
Inf. Fusion, January, 2024

Deep Kronecker Product Beamforming for Large-Scale Microphone Arrays.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Geometry Calibration for Deformable Linear Microphone Arrays With Bézier Curve Fitting.
IEEE Signal Process. Lett., 2024

X-TF-GridNet: A time-frequency domain target speaker extraction network with adaptive speaker embedding fusion.
Inf. Fusion, 2024

Frame-wise speech extraction with recursive expectation maximization for partially deformable microphone arrays.
Digit. Signal Process., 2024

Array2BR: An End-to-End Noise-immune Binaural Audio Synthesis from Microphone-array Signals.
CoRR, 2024

All Neural Kronecker Product Beamforming for Speech Extraction with Large-Scale Microphone Arrays.
Proceedings of the IEEE International Conference on Acoustics, 2024

Renet: A Time-Frequency Domain General Speech Restoration Network for Icassp 2024 Speech Signal Improvement Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2024

A Time-Frequency Band-Split Neural Network For Real-Time Full-Band Packet Loss Concealment.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Parallel processing of distributed beamforming and multichannel linear prediction for speech denoising and deverberation in wireless acoustic sensor networks.
EURASIP J. Audio Speech Music. Process., December, 2023

End-to-end neural speaker diarization with an iterative adaptive attractor estimation.
Neural Networks, September, 2023

Modelling individual head-related transfer function (HRTF) based on anthropometric parameters and generic HRTF amplitudes.
CAAI Trans. Intell. Technol., June, 2023

A General Unfolding Speech Enhancement Method Motivated by Taylor's Theorem.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Low-complexity Broadband Beampattern Synthesis using Array Response Control.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

TaylorBeamixer: Learning Taylor-Inspired All-Neural Multi-Channel Speech Enhancement from Beam-Space Dictionary Perspective.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Filtering and Refining: A Collaborative-Style Framework for Single-Channel Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

A Neural Beamspace-Domain Filter for Real-Time Multi-Channel Speech Enhancement.
Symmetry, 2022

Analysis of trade-offs between magnitude and phase estimation in loss functions for speech denoising and dereverberation.
Speech Commun., 2022

Multitarget Tracking Based on Dynamic Bayesian Network With Reparameterized Approximate Variational Inference.
IEEE Internet Things J., 2022

Wideband Multitarget Tracking Based on Dynamic Bayesian Network Learning in an Acoustic Sensor Array Network.
IEEE Internet Things J., 2022

A separation and interaction framework for causal multi-channel speech enhancement.
Digit. Signal Process., 2022

A General Deep Learning Speech Enhancement Framework Motivated by Taylor's Theorem.
CoRR, 2022

TaylorBeamixer: Learning Taylor-Inspired All-Neural Multi-Channel Speech Enhancement from Beam-Space Dictionary Perspective.
CoRR, 2022

MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient.
CoRR, 2022

Low-latency Monaural Speech Enhancement with Deep Filter-bank Equalizer.
CoRR, 2022

A Neural Beam Filter for Real-time Multi-channel Speech Enhancement.
CoRR, 2022

A deep complex network with multi-frame filtering for stereophonic acoustic echo cancellation.
CoRR, 2022

Fully Automatic Balance between Directivity Factor and White Noise Gain for Large-scale Microphone Arrays in Diffuse Noise Fields.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Bifurcation and Reunion: A Loss-Guided Two-Stage Approach for Monaural Speech Dereverberation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

TaylorBeamformer: Learning All-Neural Beamformer for Multi-Channel Speech Enhancement from Taylor's Approximation Theory.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A deep complex multi-frame filtering network for stereophonic acoustic echo cancellation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Two Heads are Better Than One: A Two-Stage Complex Spectral Mapping Approach for Monaural Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Corrigendum to 'Finite data performance analysis of one-bit MVDR and phase-only MVDR' [Signal Processing 183 (2021) Article 108018].
Signal Process., 2021

Finite data performance analysis of one-bit MVDR and phase-only MVDR.
Signal Process., 2021

Distributed node-specific block-diagonal LCMV beamforming in wireless acoustic sensor networks.
Signal Process., 2021

U<sup>2</sup>-VC: one-shot voice conversion using two-level nested U-structure.
EURASIP J. Audio Speech Music. Process., 2021

Low-complexity artificial noise suppression methods for deep learning-based speech enhancement algorithms.
EURASIP J. Audio Speech Music. Process., 2021

EmotionBox: a music-element-driven emotional music generation system using Recurrent Neural Network.
CoRR, 2021

Noise-robust blind reverberation time estimation using noise-aware time-frequency masking.
CoRR, 2021

Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement.
CoRR, 2021

Acoustic Echo Cancellation Using Deep Complex Neural Network with Nonlinear Magnitude Compression and Phase Information.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Know Your Enemy, Know Yourself: A Unified Two-Stage Framework for Speech Enhancement.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Simultaneous Denoising and Dereverberation Framework with Target Decoupling.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

ICASSP 2021 Acoustic Echo Cancellation Challenge: Integrated Adaptive Echo Cancellation with Time Alignment and Deep Learning-Based Residual Echo Plus Noise Suppression.
Proceedings of the IEEE International Conference on Acoustics, 2021

ICASSP 2021 Deep Noise Suppression Challenge: Decoupling Magnitude and Phase Optimization with a Two-Stage Deep Network.
Proceedings of the IEEE International Conference on Acoustics, 2021

Learning to Inference with Early Exit in the Progressive Speech Enhancement.
Proceedings of the 29th European Signal Processing Conference, 2021

A Robust Maximum Likelihood Distortionless Response Beamformer based on a Complex Generalized Gaussian Distribution.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Wideband sparse Bayesian learning for off-grid binaural sound source localization.
Signal Process., 2020

Two Heads Are Better Than One: A Two-Stage Approach for Monaural Noise Reduction in the Complex Domain.
CoRR, 2020

Distributed Node-Specific Block-Diagonal LCMV Beamforming in Wireless Acoustic Sensor Networks.
CoRR, 2020

Dynamic Attention Based Generative Adversarial Network with Phase Post-Processing for Speech Enhancement.
CoRR, 2020

The IOA System for Deep Noise Suppression Challenge using a Framework Combining Dynamic Attention and Recursive Learning.
CoRR, 2020

Generalization of Machine Learning for Problem Reduction: A Case Study on Travelling Salesman Problems.
CoRR, 2020

A Time-domain Monaural Speech Enhancement with Recursive Learning.
CoRR, 2020

A Recursive Network with Dynamic Attention for Monaural Speech Enhancement.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A Time-domain Monaural Speech Enhancement with Feedback Learning.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
A Supervised Speech enhancement Approach with Residual Noise Control for Voice Communication.
CoRR, 2019

Convolutional Recurrent Neural Network Based Progressive Learning for Monaural Speech Enhancement.
CoRR, 2019

2018
Quantized Kalman Filter Tracking in Directional Sensor Networks.
IEEE Trans. Mob. Comput., 2018

Statistical Analysis of the Multichannel Wiener Filter Using a Bivariate Normal Distribution for Sample Covariance Matrices.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

A perceptually motivated LP residual estimator in noisy and reverberant environments.
Speech Commun., 2018

Long-range speech acquirement and enhancement with dual-point laser Doppler vibrometers.
Proceedings of the 23rd IEEE International Conference on Digital Signal Processing, 2018

An efficient and robust speech dereverberation method using spherical microphone array.
Proceedings of the 23rd IEEE International Conference on Digital Signal Processing, 2018

2017
Robust Adaptive Beamforming Using Noise Reduction Preprocessing-Based Fully Automatic Diagonal Loading and Steering Vector Estimation.
IEEE Access, 2017

2016
Analysis of Additional Stable Gain by Frequency Shifting for Acoustic Feedback Suppression using Statistical Room Acoustics.
IEEE Signal Process. Lett., 2016

Statistical analysis and improvement of coherent-to-diffuse power ratio estimators for dereverberation.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

2015
Binaural coherent-to-diffuse-ratio estimation for dereverberation using an ITD model.
Proceedings of the 23rd European Signal Processing Conference, 2015

An improved wavelet based shock wave detector.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

Bandwidth extension for speech acquired by laser Doppler vibrometer with an auxiliary microphone.
Proceedings of the 10th International Conference on Information, 2015

2014
On Generalized Auto-Spectral Coherence Function and Its Applications to Signal Detection.
IEEE Signal Process. Lett., 2014

A Constrained MMSE LP Residual Estimator for Speech Dereverberation in Noisy Environments.
IEEE Signal Process. Lett., 2014

Statistical analysis of temporal coherence function and its application in howling detection.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

2013
A Statistical Analysis of Two-Channel Post-Filter Estimators in Isotropic Noise Fields.
IEEE Trans. Speech Audio Process., 2013

Wideband DOA estimation of frequency sparse sources with one receiver.
Proceedings of the IEEE 10th International Conference on Mobile Ad-Hoc and Sensor Systems, 2013

Wideband DOA estimation based on block FOCUSS with limited samples.
Proceedings of the IEEE Global Conference on Signal and Information Processing, 2013

A modified power-level-difference-based noise reduction for dual-microphone headsets.
Proceedings of the 9th International Conference on Information, 2013

2011
A novel wideband DOA estimator based on Khatri-Rao subspace approach.
Signal Process., 2011

Two-channel post-filtering based on adaptive smoothing and noise properties.
Proceedings of the IEEE International Conference on Acoustics, 2011

Robustness analysis of time-domain and frequency-domain adaptive null-forming schemes.
Proceedings of the 8th International Conference on Information, 2011

2010
Speech enhancement based on the structure of noise power spectral density.
Proceedings of the 18th European Signal Processing Conference, 2010

2009
Acoustical Vehicle Detection Based on Bispectral Entropy.
IEEE Signal Process. Lett., 2009

2008
On the relationship of non-parametric methods for coherence function estimation.
Signal Process., 2008

2006
Feature Extraction Using Histogram Entropies of Euclidean Distances for Vehicle Classification.
Proceedings of the Computational Intelligence and Security, International Conference, 2006


  Loading...