Xiaofei Li

Orcid: 0000-0003-0393-9905

Affiliations:
  • Westlake University, Hangzhou, Zhejiang, China
  • Inria Grenoble Rhône-Alpes, Montbonnot-Saint-Martin, France (former)


According to our database1, Xiaofei Li authored at least 50 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Self-Supervised Learning of Spatial Acoustic Representation With Cross-Channel Signal Reconstruction and Multi-Channel Conformer.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Self-Supervised Audio Teacher-Student Transformer for Both Clip-Level and Frame-Level Tasks.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers.
IEEE Signal Process. Lett., 2024

Diffusion-Based Adversarial Purification for Speaker Verification.
IEEE Signal Process. Lett., 2024

2023
Vision-audio fusion SLAM in dynamic environments.
CAAI Trans. Intell. Technol., December, 2023

2022
Enhancing direct-path relative transfer function using deep neural network for robust sound source localization.
CAAI Trans. Intell. Technol., 2022

SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

AcousticFusion: Fusing Sound Source Localization to Visual SLAM in Dynamic Environments.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Supervised Direct-Path Relative Transfer Function Learning for Binaural Sound Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Fullsubnet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Online Monaural Speech Enhancement Using Delayed Subband LSTM.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019
Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Multichannel Online Dereverberation Based on Spectral Magnitude Inverse Filtering.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Audio-Noise Power Spectral Density Estimation Using Long Short-Term Memory.
IEEE Signal Process. Lett., 2019

Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments.
IEEE J. Sel. Top. Signal Process., 2019

Narrow-band Deep Filtering for Multichannel Speech Enhancement.
CoRR, 2019

Expectation-Maximization for Speech Source Separation Using Convolutive Transfer Function.
CoRR, 2019

Expectation-maximisation for speech source separation using convolutive transfer function.
CAAI Trans. Intell. Technol., 2019

Multitask Learning of Time-Frequency CNN for Sound Source Localization.
IEEE Access, 2019

Multichannel Speech Enhancement Based On Time-Frequency Masking Using Subband Long Short-Term Memory.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Audio-Visual Variational Fusion for Multi-Person Tracking with Robots.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

2018
Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

A cascaded multiple-speaker localization and tracking system.
CoRR, 2018

Online Localization of Multiple Moving Speakers in Reverberant Environments.
Proceedings of the 10th IEEE Sensor Array and Multichannel Signal Processing Workshop, 2018

Multisource Mint Using Convolutive Transfer Function.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization With Spatial Sparsity Regularization.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Multichannel Source Separation and Speech Enhancement Using the Convolutive Transfer Function.
CoRR, 2017

Blind MultiChannel Identification and Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function.
CoRR, 2017

An em algorithm for audio source separation based on the convolutive transfer function.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Audio source separation based on convolutive transfer function and frequency-domain lasso optimization.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion.
IEEE Trans. Multim., 2016

Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Voice activity detection based on statistical likelihood ratio with adaptive thresholding.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Reverberant sound localization with a robot head based on direct-path relative transfer function.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Non-stationary noise power spectral density estimation based on regional statistics.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Binaural Sound Source Localization based on Direct-Path Relative Transfer Function.
CoRR, 2015

A Distributed Architecture for Interacting with NAO.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Estimation of relative transfer function in the presence of stationary noise based on segmental power spectral density matrix subtraction.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Local relative transfer function for sound source localization.
Proceedings of the 23rd European Signal Processing Conference, 2015

2013
Sound Source Localization for HRI Using FOC-Based Time Difference Feature and Spatial Grid Matching.
IEEE Trans. Cybern., 2013

A two-layer probabilistic model based on time-delay compensation for binaural sound localization.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

2012
Time Delay Estimation for Speech Signal Based on FOC-Spectrum.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
Sound source localization for mobile robot based on time difference feature and space grid matching.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

2010
A selection method of speech vocabulary for human-robot speech interaction.
Proceedings of the IEEE International Conference on Systems, 2010


  Loading...