Rohan Kumar Das

Orcid: 0000-0002-1332-3357

According to our database1, Rohan Kumar Das authored at least 96 papers between 2012 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Exploring Text-Queried Sound Event Detection with Audio Source Separation.
CoRR, 2024

TF-Mamba: A Time-Frequency Network for Sound Source Localization.
CoRR, 2024

Configurable DOA Estimation using Incremental Learning.
CoRR, 2024

UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection.
CoRR, 2024

WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System.
CoRR, 2024

FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels.
CoRR, 2024

How Do Neural Spoofing Countermeasures Detect Partially Spoofed Audio?
CoRR, 2024

Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan.
CoRR, 2024

Device Feature based on Graph Fourier Transformation with Logarithmic Processing For Detection of Replay Speech Attacks.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

A Synopsis of FAME 2024 Challenge: Associating Faces with Voices in Multilingual Environments.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Adaptive-Avg-Pooling Based Attention Vision Transformer for Face Anti-Spoofing.
Proceedings of the IEEE International Conference on Acoustics, 2024

Dual Knowledge Distillation for Efficient Sound Event Detection.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions.
Proceedings of the IEEE Statistical Signal Processing Workshop, 2023

A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

2022
Neural Acoustic-Phonetic Approach for Speaker Verification With Phonetic Attention Mask.
IEEE Signal Process. Lett., 2022

I4U System Description for NIST SRE'20 CTS Challenge.
CoRR, 2022

A Novel Feature Based on Graph Signal Processing for Detection of Physical Access Attacks.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

On the Use of Absolute Threshold of Hearing-based Loss for Full-band Speech Enhancement.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Dynamic Thresholding on FixMatch with Weak and Strong Data Augmentations for Sound Event Detection.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Self-Supervised Speaker Recognition with Loss-Gated Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

MFA: TDNN with Multi-Scale Frequency-Channel Attention for Text-Independent Speaker Verification with Short Utterances.
Proceedings of the IEEE International Conference on Acoustics, 2022

A Device Classification-Aided Multi-Task Framework for Low-Complexity Acoustic Scene Classification.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Modified Magnitude-Phase Spectrum Information for Spoofing Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Graph Fourier Transform Based Audio Zero-Watermarking.
IEEE Signal Process. Lett., 2021

Enhancing the Intelligibility of Cleft Lip and Palate Speech Using Cycle-Consistent Adversarial Networks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Capsule Network based End-to-end System for Detection of Replay Attacks.
Proceedings of the 12th International Symposium on Chinese Spoken Language Processing, 2021

Diagnosis of COVID-19 Using Auditory Acoustic Cues.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Data Augmentation with Signal Companding for Detection of Logical Access Attacks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Significance of Subband Features for Synthetic Speech Detection.
IEEE Trans. Inf. Forensics Secur., 2020

Long-term high frequency features for synthetic speech detection.
Digit. Signal Process., 2020

HLT-NUS Submission for NIST 2019 Multimedia Speaker Recognition Evaluation.
CoRR, 2020

The FFSVC 2020 Evaluation Plan.
CoRR, 2020

Black-box Attacks on Automatic Speaker Verification using Feedback-controlled Voice Conversion.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Personalized Singing Voice Generation Using WaveRNN.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Light Convolutional Neural Network with Feature Genuinization for Detection of Synthetic Speech Attacks.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Audio-Visual Speaker Recognition with a Cross-Modal Discriminative Network.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

The INTERSPEECH 2020 Far-Field Speaker Verification Challenge.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

The Attacker's Perspective on Automatic Speaker Verification: An Overview.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Speaker-Utterance Dual Attention for Speaker and Utterance Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-End Code-Switching TTS with Cross-Lingual Language Model.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Assessing the Scope of Generalized Countermeasures for Anti-Spoofing.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

On the Importance of Vocal Tract Constriction for Speaker Characterization: The Whispered Speech Study.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Transformer-based Arabic Dialect Identification.
Proceedings of the International Conference on Asian Language Processing, 2020

Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

Voice Conversion Challenge 2020 -- Intra-lingual semi-parallel and cross-lingual voice conversion --.
Proceedings of the Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020, 2020

Emotion Invariant Speaker Embeddings for Speaker Identification with Emotional Speech.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

HLT-NUS Submission for 2019 NIST Multimedia Speaker Recognition Evaluation.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Classification of Speech with and without Face Mask using Acoustic Features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Extraction of Octave Spectra Information for Spoofing Attack Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Low frequency frame-wise normalization over constant-Q transform for playback speech detection.
Digit. Signal Process., 2019

Investigating Text-Independent Speaker Verification Systems Under Varied Data Conditions.
Circuits Syst. Signal Process., 2019

Exploring Text-Constraint Models and Source Information for Long-Enrollment with Short-Test Speaker Verification.
Circuits Syst. Signal Process., 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
CoRR, 2019

RSL2019: A Realistic Speech Localization Corpus.
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

Robust Sound Recognition: A Neuromorphic Approach.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

On the Importance of Audio-Source Separation for Singer Identification in Polyphonic Music.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Multi-Level Adaptive Speech Activity Detector for Speech in Naturalistic Environments.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

A Unified Framework for Speaker and Utterance Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

SpeechMarker: A Voice Based Multi-Level Attendance Application.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Long Range Acoustic Features for Spoofed Speech Detection.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Instantaneous Phase and Long-Term Acoustic Cues for Orca Activity Detection.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Cross-lingual Voice Conversion with Bilingual Phonetic Posteriorgram and Average Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Modularized Neural Network with Language-Specific Output Layers for Cross-Lingual Voice Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Long Range Acoustic and Deep Features Perspective on ASVspoof 2019.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Multi-band Spectral Entropy Information for Detection of Replay Attacks.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Speaker-independent Spectral Mapping for Speech-to-Singing Conversion.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Speaker Clustering with Penalty Distance for Speaker Verification with Multi-Speaker Speech.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Significance of duration modification for speaker verification under mismatch speech tempo condition.
Int. J. Speech Technol., 2018

Multi-style speaker recognition database in practical conditions.
Int. J. Speech Technol., 2018

Analysis of Speech Emotions in Realistic Environments.
Proceedings of the 2018 Workshop on Speech, Music and Mind, 2018

Generative X-Vectors for Text-Independent Speaker Verification.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Combining Phase-based Features for Replay Spoof Detection System.
Proceedings of the 11th International Symposium on Chinese Spoken Language Processing, 2018

Extended Constant-Q Cepstral Coefficients for Detection of Spoofing Attacks.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Investigating Text-independent Speaker Verification from Practically Realizable System Perspective.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Compensating Utterance Information in Fixed Phrase Speaker Verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Instantaneous Phase and Excitation Source Features for Detection of Replay Attacks.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Development of Multi-Level Speech based Person Authentication System.
J. Signal Process. Syst., 2017

Analysis of the Intrinsic Mode Functions for Speaker Information.
Speech Commun., 2017

Exploring kernel discriminant analysis for speaker verification with limited test data.
Pattern Recognit. Lett., 2017

Role of voice activity detection methods for the speakers in the wild challenge.
Proceedings of the Twenty-third National Conference on Communications, 2017

IITG-Indigo System for NIST 2016 SRE Challenge.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Spoof Detection Using Source, Instantaneous Frequency and Cepstral Features.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
Countermeasure to handle replay attacks in practical speaker verification systems.
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Significance of constraining text in limited data text-independent speaker verification.
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Exploring Session Variability and Template Aging in Speaker Verification for Fixed Phrase Short Utterances.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Different aspects of source information for limited data speaker verification.
Proceedings of the Twenty First National Conference on Communications, 2015

Speaker verification using Gaussian posteriorgrams on fixed phrase short utterances.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Speech biometric based attendance system.
Proceedings of the Twentieth National Conference on Communications, 2014

Combining source and system information for limited data speaker verification.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
Development and evaluation of online text-independent speaker verification system for remote person authentication.
Int. J. Speech Technol., 2013

2012
Multivariability speaker recognition database in Indian scenario.
Int. J. Speech Technol., 2012


  Loading...