Real-Time Stereo Speech Enhancement with Spatial-Cue Preservation Based on Dual-Path Structure.
Proceedings of the IEEE International Conference on Acoustics, 2024
Semi-supervised Time Domain Target Speaker Extraction with Attention.
CoRR, 2022
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets.
CoRR, 2022
End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Improved Singing Voice Separation with Chromagram-Based Pitch-Aware Remixing.
Proceedings of the IEEE International Conference on Acoustics, 2022
Neural Speech Synthesis on a Shoestring: Improving the Efficiency of Lpcnet.
Proceedings of the IEEE International Conference on Acoustics, 2022
Personalized PercepNet: Real-Time, Low-Complexity Target Voice Separation and Enhancement.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Semi-Supervised Singing Voice Separation With Noisy Self-Training.
Proceedings of the IEEE International Conference on Acoustics, 2021
Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On Percepnet.
Proceedings of the IEEE International Conference on Acoustics, 2021
Enhancing into the Codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders.
Proceedings of the IEEE International Conference on Acoustics, 2021
From Speech-to-Speech Translation to Automatic Dubbing.
CoRR, 2020
From Speech-to-Speech Translation to Automatic Dubbing.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020
Generating Music with a Self-Correcting Non-Chronological Autoregressive Model.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Channel-Attention Dense U-Net for Multichannel Speech Enhancement.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Efficient Trainable Front-Ends for Neural Speech Enhancement.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Self-Supervised Classification for Detecting Anomalous Sounds.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
Group Masked Autoencoder Based Density Estimator for Audio Anomaly Detection.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020
Attention Wave-U-Net for Speech Enhancement.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019