Rohit Sinha
Orcid: 0000-0002-0419-6501Affiliations:
- Indian Institute of Technology Guwahati, India
- Cambridge University, Engineering Department, UK (former)
According to our database1,
Rohit Sinha
authored at least 96 papers
between 2002 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
Deep multi-task learning based detection of correlated mental disorders using audio modality.
Comput. Speech Lang., 2025
2024
IEEE Signal Process. Lett., 2024
Exploring the Task-agnostic Trait of Self-supervised Learning in the Context of Detecting Mental Disorders.
CoRR, 2024
Effects of Rate of Articulation in Rhythm Formant Analysis-based Dialect Classification.
Proceedings of the International Conference on Asian Language Processing, 2024
2023
Analyzing the Effect of Data Impurity on the Detection Performances of Mental Disorders.
CoRR, 2023
An Investigation on the Audio-Video Data Based Estimation of Emotion Regulation Difficulties and Their Association With Mental Disorders.
IEEE Access, 2023
Decoding Asian Elephant Vocalisations: Unravelling Call Types, Context-Specific Behaviors, and Individual Identities.
Proceedings of the Speech and Computer - 25th International Conference, 2023
Investigating the Effect of Data Impurity on the Detection Performances of Mental Disorders Through Spoken Dialogues.
Proceedings of the Speech and Computer - 25th International Conference, 2023
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023
2022
Wavelet filterbank-based EEG rhythm-specific spatial features for covert speech classification.
IET Signal Process., 2022
Circuits Syst. Signal Process., 2022
Exploring the Role of Emotion Regulation Difficulties in the Assessment of Mental Disorders.
CoRR, 2022
Classifying Mahout and Social Interactions of Asian Elephants Based on Trumpet Calls.
Proceedings of the Speech and Computer - 24th International Conference, 2022
Influence of Accented Speech in Automatic Speech Recognition: A Case Study on Assamese L1 Speakers Speaking Code Switched Hindi-English.
Proceedings of the Speech and Computer - 24th International Conference, 2022
Sensitivity Analysis of MaskCycleGAN based Voice Conversion for Enhancing Cleft Lip and Palate Speech Recognition.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022
2021
Proceedings of the IEEE Region 10 Conference, 2021
Enhancing the Intelligibility of Cleft Lip and Palate Speech Using Cycle-Consistent Adversarial Networks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Novel textual features for language modeling of intra-sentential code-switching data.
Comput. Speech Lang., 2020
Exploration of End-to-End Framework for Code-Switching Speech Recognition Task: Challenges and Enhancements.
IEEE Access, 2020
Genetic Algorithm Optimized Structured Dictionary for Discriminative Block Sparse Representation.
IEEE Access, 2020
Joint Language Identification of Code-Switching Speech using Attention-based E2E Network.
Proceedings of the International Conference on Signal Processing and Communications, 2020
Proceedings of the 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2020
IITG- Indigo Submissions for NIST 2018 Speaker Recognition Evaluation and Post-Challenge Improvements.
Proceedings of the 2020 National Conference on Communications, 2020
Investigating Target Set Reduction for End-to-End Speech Recognition of Hindi-English Code-Switching Data.
Proceedings of the 2020 National Conference on Communications, 2020
2019
IITG-HingCoS corpus: A Hinglish code-switching database for automatic speech recognition.
Speech Commun., 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2018
Cooperative Spectrum Sensing Using Quantized Energy Statistics in the Absence of Dedicated Reporting Channel.
IEEE Trans. Veh. Technol., 2018
Improved Structured Dictionary Learning via Correlation and Class Based Block Formation.
IEEE Trans. Signal Process., 2018
Speech Commun., 2018
Pattern Recognit. Lett., 2018
Sparse coding of i-vector/JFA latent vector over ensemble dictionaries for language identification systems.
Int. J. Speech Technol., 2018
Studying the role of pitch-adaptive spectral estimation and speaking-rate normalization in automatic speech recognition.
Digit. Signal Process., 2018
A Fast Adaptation Approach for Enhanced Automatic Recognition of Children's Speech with Mismatched Acoustic Models.
Circuits Syst. Signal Process., 2018
Assessment of pitch-adaptive front-end signal processing for children's speech recognition.
Comput. Speech Lang., 2018
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
A Novel Approach for Effective Recognition of the Code-Switched Data on Monolingual Language Model.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
AGROASSAM: A Web Based Assamese Speech Recognition Application for Retrieving Agricultural Commodity Price and Weather Information.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018
2017
Improvements in IITG Assamese Spoken Query System: Background Noise Suppression and Alternate Acoustic Modeling.
J. Signal Process. Syst., 2017
IEEE Signal Process. Lett., 2017
IEEE Signal Process. Lett., 2017
Sparse coding over redundant dictionaries for fast adaptation of speech recognition system.
Comput. Speech Lang., 2017
Correlation and Class Based Block Formation for Improved Structured Dictionary Learning.
CoRR, 2017
Improving children speech recognition in acoustically mismatched condition using eigenvoices and feature projections.
Proceedings of the Twenty-third National Conference on Communications, 2017
Proceedings of the Twenty-third National Conference on Communications, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
2016
CoRR, 2016
Semi-Coupled Dictionary Based Automatic Bandwidth Extension Approach for Enhancing Children's ASR.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
2015
Low Complexity On-Line Adaptation Techniques in Context of Assamese Spoken Query System.
J. Signal Process. Syst., 2015
IEEE Trans. Inf. Forensics Secur., 2015
Speech Commun., 2015
Int. J. Speech Technol., 2015
Electrocardiogram signal denoising using non-local wavelet transform domain filtering.
IET Signal Process., 2015
A Gaussian Scale Space Approach For Exudates Detection, Classification And Severity Prediction.
CoRR, 2015
Sparse coding based spectrum sensing in presence of multiple frequency hopping primary users.
Proceedings of the Twenty First National Conference on Communications, 2015
Low-memory fast on-line adaptation for acoustically mismatched children's speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
Improved Bases Selection in Acoustic Model Interpolation for Fast On-Line Adaptation.
IEEE Signal Process. Lett., 2014
Exploring Data-Independent Dimensionality Reduction in Sparse Representation-Based Speaker Identification.
Circuits Syst. Signal Process., 2014
Proceedings of the Twentieth National Conference on Communications, 2014
A low complexity cluster model interpolation based on-line adaptation technique for spoken query systems.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014
A low complexity model adaptation approach involving sparse coding over multiple dictionaries.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Int. J. Speech Technol., 2012
Int. J. Speech Technol., 2012
On exploring the similarity and fusion of i-vector and sparse representation based speaker verification systems.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012
Sparse representation over learned and discriminatively learned dictionaries for speaker verification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
A Study on the Effect of Pitch on LPCC and PLPC Features for Children's ASR in Comparison to MFCC.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
2010
Exploring the Effect of Differences in the Acoustic Correlates of Adults' and Children's Speech in the Context of Automatic Speech Recognition.
EURASIP J. Audio Speech Music. Process., 2010
Enhancing children's speech recognition under mismatched condition by explicit acoustic normalization.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Exploring the role of spectral smoothing in context of children's speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
2008
A shift-based approach to speaker normalization using non-linear frequency-scaling model.
Speech Commun., 2008
Energy and entropy based switching algorithm for speech endpoint detection in varying SNR conditions.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
A Study of Filter Bank Smoothing in MFCC Features for Recognition of Children's Speech.
IEEE Trans. Speech Audio Process., 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
IEEE Trans. Speech Audio Process., 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002