Rohit Sinha

Orcid: 0000-0002-0419-6501

Affiliations:
  • Indian Institute of Technology Guwahati, India
  • Cambridge University, Engineering Department, UK (former)


According to our database1, Rohit Sinha authored at least 99 papers between 2002 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Deep multi-task learning based detection of correlated mental disorders using audio modality.
Comput. Speech Lang., 2025

2024
Spectro-Temporally Compressed Source Features for Replay Attack Detection.
IEEE Signal Process. Lett., 2024

Exploring the Task-agnostic Trait of Self-supervised Learning in the Context of Detecting Mental Disorders.
CoRR, 2024

Effects of Rate of Articulation in Rhythm Formant Analysis-based Dialect Classification.
Proceedings of the International Conference on Asian Language Processing, 2024

2023
Analyzing the Effect of Data Impurity on the Detection Performances of Mental Disorders.
CoRR, 2023

An Investigation on the Audio-Video Data Based Estimation of Emotion Regulation Difficulties and Their Association With Mental Disorders.
IEEE Access, 2023

Decoding Asian Elephant Vocalisations: Unravelling Call Types, Context-Specific Behaviors, and Individual Identities.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Investigating the Effect of Data Impurity on the Detection Performances of Mental Disorders Through Spoken Dialogues.
Proceedings of the Speech and Computer - 25th International Conference, 2023

ASHI: A Database of Assamese Accented Hindi.
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023

2022
Wavelet filterbank-based EEG rhythm-specific spatial features for covert speech classification.
IET Signal Process., 2022

A New Framework for Artificial Bandwidth Extension Using H<sup>∞</sup> Filtering.
Circuits Syst. Signal Process., 2022

Exploring the Role of Emotion Regulation Difficulties in the Assessment of Mental Disorders.
CoRR, 2022

Classifying Mahout and Social Interactions of Asian Elephants Based on Trumpet Calls.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Influence of Accented Speech in Automatic Speech Recognition: A Case Study on Assamese L1 Speakers Speaking Code Switched Hindi-English.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Sensitivity Analysis of MaskCycleGAN based Voice Conversion for Enhancing Cleft Lip and Palate Speech Recognition.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022

2021
On the Estimation of Difficulty in Emotion Regulation using Spoken Dialogue.
Proceedings of the IEEE Region 10 Conference, 2021

Enhancing the Intelligibility of Cleft Lip and Palate Speech Using Cycle-Consistent Adversarial Networks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Processing Phoneme Specific Segments for Cleft Lip and Palate Speech Enhancement.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021

2020
Novel textual features for language modeling of intra-sentential code-switching data.
Comput. Speech Lang., 2020

Exploration of End-to-End Framework for Code-Switching Speech Recognition Task: Challenges and Enhancements.
IEEE Access, 2020

Genetic Algorithm Optimized Structured Dictionary for Discriminative Block Sparse Representation.
IEEE Access, 2020

Joint Language Identification of Code-Switching Speech using Attention-based E2E Network.
Proceedings of the International Conference on Signal Processing and Communications, 2020

Mizo Spoken Query System Enhanced with Prosodic Information.
Proceedings of the 23rd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2020

IITG- Indigo Submissions for NIST 2018 Speaker Recognition Evaluation and Post-Challenge Improvements.
Proceedings of the 2020 National Conference on Communications, 2020

Investigating Target Set Reduction for End-to-End Speech Recognition of Hindi-English Code-Switching Data.
Proceedings of the 2020 National Conference on Communications, 2020

2019
IITG-HingCoS corpus: A Hinglish code-switching database for automatic speech recognition.
Speech Commun., 2019

SpeechMarker: A Voice Based Multi-Level Attendance Application.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Cooperative Spectrum Sensing Using Quantized Energy Statistics in the Absence of Dedicated Reporting Channel.
IEEE Trans. Veh. Technol., 2018

Improved Structured Dictionary Learning via Correlation and Class Based Block Formation.
IEEE Trans. Signal Process., 2018

Improving children's mismatched ASR using structured low-rank feature projection.
Speech Commun., 2018

Enrollee-constrained sparse coding of test data for speaker verification.
Pattern Recognit. Lett., 2018

Sparse coding of i-vector/JFA latent vector over ensemble dictionaries for language identification systems.
Int. J. Speech Technol., 2018

Studying the role of pitch-adaptive spectral estimation and speaking-rate normalization in automatic speech recognition.
Digit. Signal Process., 2018

A Fast Adaptation Approach for Enhanced Automatic Recognition of Children's Speech with Mismatched Acoustic Models.
Circuits Syst. Signal Process., 2018

Assessment of pitch-adaptive front-end signal processing for children's speech recognition.
Comput. Speech Lang., 2018

Hindi-English Code-Switching Speech Corpus.
CoRR, 2018

Some Experiments on Context Mismatched Speech Recognition.
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Exploration of Compressed ILPR Features for Replay Attack Detection.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

A Novel Approach for Effective Recognition of the Code-Switched Data on Monolingual Language Model.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Robust Mizo Continuous Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

AGROASSAM: A Web Based Assamese Speech Recognition Application for Retrieving Agricultural Commodity Price and Weather Information.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Exploring Sparse Representation for Improved Online Handwriting Recognition.
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018

2017
Improvements in IITG Assamese Spoken Query System: Background Noise Suppression and Alternate Acoustic Modeling.
J. Signal Process. Syst., 2017

Pitch-Normalized Acoustic Features for Robust Children's Speech Recognition.
IEEE Signal Process. Lett., 2017

Incorporating Primary User Interference for Enhanced Spectrum Sensing.
IEEE Signal Process. Lett., 2017

Sparse coding over redundant dictionaries for fast adaptation of speech recognition system.
Comput. Speech Lang., 2017

Language Modeling for Code-Switched Data: Challenges and Approaches.
CoRR, 2017

Correlation and Class Based Block Formation for Improved Structured Dictionary Learning.
CoRR, 2017

Improving children speech recognition in acoustically mismatched condition using eigenvoices and feature projections.
Proceedings of the Twenty-third National Conference on Communications, 2017

Role of voice activity detection methods for the speakers in the wild challenge.
Proceedings of the Twenty-third National Conference on Communications, 2017

IITG-Indigo System for NIST 2016 SRE Challenge.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Spoof Detection Using Source, Instantaneous Frequency and Cepstral Features.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Enhancing noise and pitch robustness of children's ASR.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
An Unsupervised Method for Detection and Validation of The Optic Disc and The Fovea.
CoRR, 2016

Low complexity language recognition exploiting ensemble of random subspace.
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Countermeasure to handle replay attacks in practical speaker verification systems.
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Exploring the role of pitch-adaptive cepstral features in context of children's mismatched ASR.
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Semi-Coupled Dictionary Based Automatic Bandwidth Extension Approach for Enhancing Children's ASR.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Pitch-Adaptive Front-End Features for Robust Children's ASR.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Low Complexity On-Line Adaptation Techniques in Context of Assamese Spoken Query System.
J. Signal Process. Syst., 2015

Robust Speaker Verification With Joint Sparse Coding Over Learned Dictionaries.
IEEE Trans. Inf. Forensics Secur., 2015

An Efficient SVD Shrinkage for Rank Estimation.
IEEE Signal Process. Lett., 2015

Low-complexity speaker verification with decimated supervector representations.
Speech Commun., 2015

Pitch adaptive MFCC features for improving children's mismatched ASR.
Int. J. Speech Technol., 2015

Electrocardiogram signal denoising using non-local wavelet transform domain filtering.
IET Signal Process., 2015

A Gaussian Scale Space Approach For Exudates Detection, Classification And Severity Prediction.
CoRR, 2015

Sparse coding based spectrum sensing in presence of multiple frequency hopping primary users.
Proceedings of the Twenty First National Conference on Communications, 2015

Low-memory fast on-line adaptation for acoustically mismatched children's speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Speaker verification using Gaussian posteriorgrams on fixed phrase short utterances.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Improved Bases Selection in Acoustic Model Interpolation for Fast On-Line Adaptation.
IEEE Signal Process. Lett., 2014

Exploring Data-Independent Dimensionality Reduction in Sparse Representation-Based Speaker Identification.
Circuits Syst. Signal Process., 2014

Speech biometric based attendance system.
Proceedings of the Twentieth National Conference on Communications, 2014

A low complexity cluster model interpolation based on-line adaptation technique for spoken query systems.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A low complexity model adaptation approach involving sparse coding over multiple dictionaries.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

2013
The IITG speaker verification systems for NIST SRE 2012.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Speaker verification in sensor and acoustic environment mismatch conditions.
Int. J. Speech Technol., 2012

Multivariability speaker recognition database in Indian scenario.
Int. J. Speech Technol., 2012

On exploring the similarity and fusion of i-vector and sparse representation based speaker verification systems.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

Sparse representation over learned and discriminatively learned dictionaries for speaker verification.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
A Study on the Effect of Pitch on LPCC and PLPC Features for Children's ASR in Comparison to MFCC.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
Exploring the Effect of Differences in the Acoustic Correlates of Adults' and Children's Speech in the Context of Automatic Speech Recognition.
EURASIP J. Audio Speech Music. Process., 2010

Enhancing children's speech recognition under mismatched condition by explicit acoustic normalization.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
On the use of pitch normalization for improving children's speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Exploring the role of spectral smoothing in context of children's speech recognition.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
A shift-based approach to speaker normalization using non-linear frequency-scaling model.
Speech Commun., 2008

Energy and entropy based switching algorithm for speech endpoint detection in varying SNR conditions.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
A Study of Filter Bank Smoothing in MFCC Features for Recognition of Children's Speech.
IEEE Trans. Speech Audio Process., 2007

Improving Speech Transcription for Mandarin-English Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Speech Recognition System Combination for Machine Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Progress in the CU-HTK broadcast news transcription system.
IEEE Trans. Speech Audio Process., 2006

The Cu-Htk Mandarin Broadcast News Transcription System.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Study Of Non-Linear Frequency Warping Functions For Speaker Normalization.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
The Cambridge University March 2005 speaker diarisation system.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
An investigation into front-end signal processing for speaker normalization.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Non-uniform speaker normalization using affine-transformation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
A method for compensation of Jacobian in speaker normalization.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
A simple approach to non-uniform vowel normalization.
Proceedings of the IEEE International Conference on Acoustics, 2002

Non-uniform scaling based speaker normalization.
Proceedings of the IEEE International Conference on Acoustics, 2002


  Loading...