Rohan Kumar Das
Orcid: 0000-0002-1332-3357
According to our database1,
Rohan Kumar Das
authored at least 96 papers
between 2012 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
CoRR, 2024
CoRR, 2024
WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System.
CoRR, 2024
FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels.
CoRR, 2024
Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan.
CoRR, 2024
Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024
A Synopsis of FAME 2024 Challenge: Associating Faces with Voices in Multilingual Environments.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
Proceedings of the IEEE International Conference on Acoustics, 2024
2023
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions.
Proceedings of the IEEE Statistical Signal Processing Workshop, 2023
A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
2022
Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask.
IEEE Signal Process. Lett., 2022
A Novel Feature Based on Graph Signal Processing for Detection of Physical Access Attacks.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022
On the Use of Absolute Threshold of Hearing-based Loss for Full-band Speech Enhancement.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Dynamic Thresholding on FixMatch with Weak and Strong Data Augmentations for Sound Event Detection.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances.
Proceedings of the IEEE International Conference on Acoustics, 2022
A Device Classification-Aided Multi-Task Framework for Low-Complexity Acoustic Scene Classification.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022
2021
IEEE ACM Trans. Audio Speech Lang. Process., 2021
IEEE Signal Process. Lett., 2021
Enhancing the Intelligibility of Cleft Lip and Palate Speech Using Cycle-Consistent Adversarial Networks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
IEEE Trans. Inf. Forensics Secur., 2020
Digit. Signal Process., 2020
CoRR, 2020
Black-box Attacks on Automatic Speaker Verification using Feedback-controlled Voice Conversion.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020
Light Convolutional Neural Network with Feature Genuinization for Detection of Synthetic Speech Attacks.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
On the Importance of Vocal Tract Constriction for Speaker Characterization: The Whispered Speech Study.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the International Conference on Asian Language Processing, 2020
Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020
Voice Conversion Challenge 2020 -- Intra-lingual semi-parallel and cross-lingual voice conversion --.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020
Emotion Invariant Speaker Embeddings for Speaker Identification with Emotional Speech.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020
2019
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Low frequency frame-wise normalization over constant-Q transform for playback speech detection.
Digit. Signal Process., 2019
Investigating Text-Independent Speaker Verification Systems Under Varied Data Conditions.
Circuits Syst. Signal Process., 2019
Exploring Text-Constraint Models and Source Information for Long-Enrollment with Short-Test Speaker Verification.
Circuits Syst. Signal Process., 2019
CoRR, 2019
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
On the Importance of Audio-Source Separation for Singer Identification in Polyphonic Music.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Multi-Level Adaptive Speech Activity Detector for Speech in Naturalistic Environments.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Cross-lingual Voice Conversion with Bilingual Phonetic Posteriorgram and Average Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2019
A Modularized Neural Network with Language-Specific Output Layers for Cross-Lingual Voice Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019
Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019
2018
Significance of duration modification for speaker verification under mismatch speech tempo condition.
Int. J. Speech Technol., 2018
Int. J. Speech Technol., 2018
Proceedings of the 2018 Workshop on Speech, Music and Mind, 2018
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Investigating Text-independent Speaker Verification from Practically Realizable System Perspective.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
J. Signal Process. Syst., 2017
Speech Commun., 2017
Exploring kernel discriminant analysis for speaker verification with limited test data.
Pattern Recognit. Lett., 2017
Proceedings of the Twenty-third National Conference on Communications, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016
Significance of constraining text in limited data text-independent speaker verification.
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016
Exploring Session Variability and Template Aging in Speaker Verification for Fixed Phrase Short Utterances.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
Proceedings of the Twenty First National Conference on Communications, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Proceedings of the Twentieth National Conference on Communications, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
2013
Development and evaluation of online text-independent speaker verification system for remote person authentication.
Int. J. Speech Technol., 2013
2012
Int. J. Speech Technol., 2012