Xiaofei Li

IEEE ACM Trans. Audio Speech Lang. Process., 2024

IPDnet: A Universal Direct-Path IPD Estimation Network for Sound Source Localization.

[BibT_eX]

[DOI]

Yabo Wang

IEEE ACM Trans. Audio Speech Lang. Process., 2024

SpatialNet: Extensively Learning Spatial Information for Multichannel Joint Speech Separation, Denoising and Dereverberation.

[BibT_eX]

[DOI]

Changsheng Quan

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Self-Supervised Audio Teacher-Student Transformer for Both Clip-Level and Frame-Level Tasks.

[BibT_eX]

[DOI]

Xian Li

Nian Shao

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Multichannel Long-Term Streaming Neural Speech Enhancement for Static and Moving Speakers.

[BibT_eX]

[DOI]

Changsheng Quan

IEEE Signal Process. Lett., 2024

Diffusion-Based Adversarial Purification for Speaker Verification.

[BibT_eX]

[DOI]

Yibo Bai

Xuelong Li

IEEE Signal Process. Lett., 2024

2023

Vision-audio fusion SLAM in dynamic environments.

[BibT_eX]

[DOI]

Tianwei Zhang

Huayan Zhang

CAAI Trans. Intell. Technol., December, 2023

2022

Enhancing direct-path relative transfer function using deep neural network for robust sound source localization.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2022

SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

AcousticFusion: Fusing Sound Source Localization to Visual SLAM in Dynamic Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Supervised Direct-Path Relative Transfer Function Learning for Binaural Sound Source Localization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Fullsubnet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Online Monaural Speech Enhancement Using Delayed Subband LSTM.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Multichannel Online Dereverberation Based on Spectral Magnitude Inverse Filtering.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2019

Audio-Noise Power Spectral Density Estimation Using Long Short-Term Memory.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2019

Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments.

[BibT_eX]

[DOI]

Yutong Ban

IEEE J. Sel. Top. Signal Process., 2019

Narrow-band Deep Filtering for Multichannel Speech Enhancement.

[BibT_eX]

[DOI]

CoRR, 2019

Expectation-Maximization for Speech Source Separation Using Convolutive Transfer Function.

[BibT_eX]

[DOI]

CoRR, 2019

Expectation-maximisation for speech source separation using convolutive transfer function.

[BibT_eX]

[DOI]

CAAI Trans. Intell. Technol., 2019

Multitask Learning of Time-Frequency CNN for Sound Source Localization.

[BibT_eX]

[DOI]

Cheng Pang

IEEE Access, 2019

Multichannel Speech Enhancement Based On Time-Frequency Masking Using Subband Long Short-Term Memory.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Audio-Visual Variational Fusion for Multi-Person Tracking with Robots.

[BibT_eX]

[DOI]

Proceedings of the 27th ACM International Conference on Multimedia, 2019

2018

Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Audio-Visual Speaker Diarization Based on Spatiotemporal Bayesian Fusion.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

A cascaded multiple-speaker localization and tracking system.

[BibT_eX]

[DOI]

Yutong Ban

CoRR, 2018

Online Localization of Multiple Moving Speakers in Reverberant Environments.

[BibT_eX]

[DOI]

Proceedings of the 10th IEEE Sensor Array and Multichannel Signal Processing Workshop, 2018

Multisource Mint Using Convolutive Transfer Function.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking.

[BibT_eX]

[DOI]

Yutong Ban

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization With Spatial Sparsity Regularization.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Multichannel Source Separation and Speech Enhancement Using the Convolutive Transfer Function.

[BibT_eX]

[DOI]

CoRR, 2017

Blind MultiChannel Identification and Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function.

[BibT_eX]

[DOI]

Sharon Gannot

CoRR, 2017

An em algorithm for audio source separation based on the convolutive transfer function.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Audio source separation based on convolutive transfer function and frequency-domain lasso optimization.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2016

Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Voice activity detection based on statistical likelihood ratio with adaptive thresholding.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Reverberant sound localization with a robot head based on direct-path relative transfer function.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Non-stationary noise power spectral density estimation based on regional statistics.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Binaural Sound Source Localization based on Direct-Path Relative Transfer Function.

[BibT_eX]

[DOI]

CoRR, 2015

A Distributed Architecture for Interacting with NAO.

[BibT_eX]

[DOI]

Georgios D. Evangelidis

Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Estimation of relative transfer function in the presence of stationary noise based on segmental power spectral density matrix subtraction.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Local relative transfer function for sound source localization.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

2013

Sound Source Localization for HRI Using FOC-Based Time Difference Feature and Spatial Grid Matching.

[BibT_eX]

[DOI]

IEEE Trans. Cybern., 2013

A two-layer probabilistic model based on time-delay compensation for binaural sound localization.

[BibT_eX]

[DOI]

Zhuo Fu

Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

2012

Time Delay Estimation for Speech Signal Based on FOC-Spectrum.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

Sound source localization for mobile robot based on time difference feature and space grid matching.

[BibT_eX]

[DOI]

Xuesong Yang

Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

2010

A selection method of speech vocabulary for human-robot speech interaction.

[BibT_eX]

[DOI]