Anurag Kumar
Affiliations:- Facebook Research, Facebook Reality Labs, Redmond, WA USA
- Indian Institute of Technology, Kanpur, India (former)
According to our database1,
Anurag Kumar
authored at least 74 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Neural-Network-Based Direction-of-Arrival Estimation for Reverberant Speech - The Importance of Energetic, Temporal, and Spatial Information.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
Improved direction of arrival estimations with a wearable microphone array for dynamic environments by reliability weighting.
CoRR, 2024
High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching.
CoRR, 2024
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling.
CoRR, 2024
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement.
CoRR, 2024
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
A Closer Look at Wav2vec2 Embeddings for On-Device Single-Channel Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2024
Audiovisual Speaker Separation with Full- and Sub-Band Modeling in the Time-Frequency Domain.
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch.
CoRR, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Torchaudio-Squim: Reference-Less Speech Quality and Intelligibility Measures in Torchaudio.
Proceedings of the IEEE International Conference on Acoustics, 2023
Leveraging Heteroscedastic Uncertainty in Learning Complex Spectral Mapping for Single-Channel Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023
TorchAudio 2.1: Advancing Speech Recognition, Self-Supervised Learning, and Audio Processing Components for Pytorch.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023
2022
RemixIT: Continual Self-Training of Speech Enhancement Models via Bootstrapped Remixing.
IEEE J. Sel. Top. Signal Process., 2022
Direction Of Arrival Estimation For Reverberant Speech Based On Neural Networks And The Direct-Path Dominance Test.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
TPARN: Triple-Path Attentive Recurrent Network for Time-Domain Multichannel Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Audio Signal Processing for Telepresence Based on Wearable Array in Noisy and Dynamic Scenes.
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
SAGRNN: Self-Attentive Gated RNN For Binaural Speaker Separation With Interaural Cue Preservation.
IEEE Signal Process. Lett., 2021
CoRR, 2021
TADRN: Triple-Attentive Dual-Recurrent Network for Ad-hoc Array Multichannel Speech Enhancement.
CoRR, 2021
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Do Sound Event Representations Generalize to Other Audio Tasks? A Case Study in Audio Transfer Learning.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Incorporating Real-World Noisy Speech in Neural-Network-Based Speech Enhancement Systems.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
2020
A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition.
Proceedings of the 37th International Conference on Machine Learning, 2020
SeCoST: : Sequential Co-Supervision for Large Scale Weakly Labeled Audio Event Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
2019
CoRR, 2019
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019
2018
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Knowledge Transfer from Weakly Labeled Audio Using Convolutional Neural Network for Sound Events and Scenes.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
CoRR, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Audio event and scene recognition: A unified approach using strongly and weakly labeled data.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
Proceedings of the 25th European Signal Processing Conference, 2017
2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016
Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016
2015
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014
Proceedings of the Twentieth National Conference on Communications, 2014
Proceedings of the 22nd European Signal Processing Conference, 2014
2013
Event detection in short duration audio using Gaussian Mixture Model and Random Forest Classifier.
Proceedings of the 21st European Signal Processing Conference, 2013
2012
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012