K. Sreenivasa Rao
Orcid: 0000-0001-6112-6887Affiliations:
- Indian Institute of Technology Kharagpur, West Bengal, India
According to our database1,
K. Sreenivasa Rao
authored at least 194 papers
between 2002 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Circuits Syst. Signal Process., August, 2024
Speech emotion recognition with transfer learning and multi-condition training for noisy environments.
Int. J. Speech Technol., June, 2024
A multi-modal lecture video indexing and retrieval framework with multi-scale residual attention network and multi-similarity computation.
Signal Image Video Process., April, 2024
Automatic classification of neurological voice disorders using wavelet scattering features.
Speech Commun., 2024
Hierarchical emotion recognition from speech using source, power spectral and prosodic features.
Multim. Tools Appl., 2024
NeuralMultiling: A Novel Neural Architecture Search for Smartphone based Multilingual Speaker Verification.
CoRR, 2024
Straight Through Gumbel Softmax Estimator based Bimodal Neural Architecture Search for Audio-Visual Deepfake Detection.
CoRR, 2024
MLSD-GAN - Generating Strong High Quality Face Morphing Attacks using Latent Semantic Disentanglement.
CoRR, 2024
2023
Int. J. Speech Technol., September, 2023
A Novel Zero-Resource Spoken Term Detection Using Affinity Kernel Propagation with Acoustic Feature Map.
SN Comput. Sci., May, 2023
Multim. Tools Appl., 2023
Proceedings of the Pattern Recognition and Machine Intelligence, 2023
Relation Predictions in Comorbid Disease Centric Knowledge Graph Using Heterogeneous GNN Models.
Proceedings of the Bioinformatics and Biomedical Engineering, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Similarity-based Multi-Modal Lecture Video Indexing and Retrieval with Deep Learning.
Proceedings of the 14th International Conference on Computing Communication and Networking Technologies, 2023
ExtSwap: Leveraging Extended Latent Mapper for Generating High Quality Face Swapping.
Proceedings of the Computer Vision and Image Processing - 8th International Conference, 2023
2022
VOP detection for read and conversation speech using CWT coefficients and phone boundaries.
J. Ambient Intell. Humaniz. Comput., 2022
Dysarthric speech detection from telephone quality speech using epoch-based pitch perturbation features.
Int. J. Speech Technol., 2022
Correction to: CycleGAN-Based Speech Mode Transformation Model for Robust Multilingual ASR.
Circuits Syst. Signal Process., 2022
Circuits Syst. Signal Process., 2022
Phoneme Segmentation-Based Unsupervised Pattern Discovery and Clustering of Speech Signals.
Circuits Syst. Signal Process., 2022
A novel approach to unsupervised pattern discovery in speech using Convolutional Neural Network.
Comput. Speech Lang., 2022
CoRR, 2022
Proceedings of the 14th Annual Meeting of the Forum for Information Retrieval Evaluation, 2022
2021
IEEE ACM Trans. Comput. Biol. Bioinform., 2021
Approaches for Multilingual Phone Recognition in Code-switched and Non-code-switched Scenarios Using Indian Languages.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework.
Int. J. Speech Technol., 2021
Int. J. Pervasive Comput. Commun., 2021
Circuits Syst. Signal Process., 2021
Circuits Syst. Signal Process., 2021
Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey.
IEEE Access, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
2020
ACM Trans. Intell. Syst. Technol., 2020
Children's Story Classification in Indian Languages Using Linguistic and Keyword-based Features.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2020
Speech Commun., 2020
Robust <i>f</i><sub>0</sub> extraction from monophonic signals using adaptive sub-band filtering.
Speech Commun., 2020
Neural Process. Lett., 2020
Excitation modelling using epoch features for statistical parametric speech synthesis.
Comput. Speech Lang., 2020
IEEE Access, 2020
Proceedings of the 2020 National Conference on Communications, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2019
IEEE Signal Process. Lett., 2019
Development and analysis of multilingual phone recognition systems using Indian languages.
Int. J. Speech Technol., 2019
Circuits Syst. Signal Process., 2019
Autom. Control. Comput. Sci., 2019
Glottal Closure Instants Detection from Speech Signal by Deep Features Extracted from Raw Speech and Linear Prediction Residual.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Source Modeling Techniques for Quality Enhancement in Statistical Parametric Speech Synthesis - Studies in Speech Signal Processing, Natural Language Understanding, and Machine Learning
Springer, ISBN: 978-3-030-02758-2, 2019
2018
Speech Commun., 2018
Pattern Recognit. Lett., 2018
Neural network and GMM based feature mappings for consonant-vowel recognition in emotional environment.
Int. J. Speech Technol., 2018
Int. J. Speech Technol., 2018
Int. J. Speech Technol., 2018
IET Signal Process., 2018
Circuits Syst. Signal Process., 2018
Predominant Melody Extraction from Vocal Polyphonic Music Signal by Time-Domain Adaptive Filtering-Based Method.
Circuits Syst. Signal Process., 2018
Glottal Closure Instants Detection From Pathological Acoustic Speech Signal Using Deep Learning.
CoRR, 2018
Beam Search Decoding using Manner of Articulation Detection Knowledge Derived from Connectionist Temporal Classification.
CoRR, 2018
Manner of Articulation Detection using Connectionist Temporal Classification to Improve Automatic Speech Recognition Performance.
CoRR, 2018
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Indian Languages ASR: A Multilingual Phone Recognition Framework with IPA Based Common Phone-set, Predicted Articulatory Features and Feature fusion.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Harmonic-Percussive Source Separation of Polyphonic Music by Suppressing Impulsive Noise Events.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Modifying LSTM Posteriors with Manner of Articulation Knowledge to Improve Speech Recognition Performance.
Proceedings of the 17th IEEE International Conference on Machine Learning and Applications, 2018
Robust Detection of Glottal Activity Using Unwrapped Phase Electroglottographic Signal.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the 2018 International Conference on Advances in Computing, 2018
Proceedings of the 2018 International Conference on Advances in Computing, 2018
Proceedings of the 2018 International Conference on Advances in Computing, 2018
2017
IEEE Signal Process. Lett., 2017
Supervector-based approaches in a discriminative framework for speaker verification in noisy environments.
Int. J. Speech Technol., 2017
Modification of energy spectra, epoch parameters and prosody for emotion conversion in speech.
Int. J. Speech Technol., 2017
Parameterization of Excitation Signal for Improving the Quality of HMM-Based Speech Synthesis System.
Circuits Syst. Signal Process., 2017
Comput. Speech Lang., 2017
Parametric representation of excitation source information for language identification.
Comput. Speech Lang., 2017
Comput. Speech Lang., 2017
Biomed. Signal Process. Control., 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
Time-domain deterministic plus noise model based hybrid source modeling for statistical parametric speech synthesis.
Speech Commun., 2016
Speech Commun., 2016
Articulatory and excitation source features for speech recognition in read, extempore and conversation modes.
Int. J. Speech Technol., 2016
Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks.
Neurocomputing, 2016
Circuits Syst. Signal Process., 2016
A Robust Non-Parametric and Filtering Based Approach for Glottal Closure Instant Detection.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Enhanced Harmonic Content and Vocal Note Based Predominant Melody Extraction from Vocal Polyphonic Music Signals.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 13th International Conference on Natural Language Processing, 2016
Predominant melody extraction from vocal polyphonic music signal by combined spectro-temporal method.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
A deterministic plus noise model of excitation signal using principal component analysis for parametric speech synthesis.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Proceedings of the 2016 International Conference on Advances in Computing, 2016
Proceedings of the 2016 International Conference on Advances in Computing, 2016
Proceedings of the Ninth International Conference on Contemporary Computing, 2016
2015
Signal Image Video Process., 2015
Int. J. Speech Technol., 2015
Circuits Syst. Signal Process., 2015
Proceedings of the Twenty First National Conference on Communications, 2015
Hybrid Source Modeling Method Utilizing Optimal Residual Frames for HMM-based Speech Synthesis.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the Eighth International Conference on Advances in Pattern Recognition, 2015
Proceedings of the Eighth International Conference on Advances in Pattern Recognition, 2015
Proceedings of the Eighth International Conference on Advances in Pattern Recognition, 2015
Proceedings of the 2015 International Conference on Advances in Computing, 2015
Proceedings of the 2015 International Conference on Advances in Computing, 2015
Analysis and modeling pauses for synthesis of storytelling speech based on discourse modes.
Proceedings of the Eighth International Conference on Contemporary Computing, 2015
Proceedings of the Eighth International Conference on Contemporary Computing, 2015
Proceedings of the Eighth International Conference on Contemporary Computing, 2015
Proceedings of the Eighth International Conference on Contemporary Computing, 2015
Proceedings of the Eighth International Conference on Contemporary Computing, 2015
Proceedings of the Eighth International Conference on Contemporary Computing, 2015
2014
Springer Briefs in Electrical and Computer Engineering, Springer, ISBN: 978-3-319-03116-3, 2014
Segmentation, indexing and retrieval of TV broadcast news bulletins using Gaussian mixture models and vector quantization codebooks.
Int. J. Speech Technol., 2014
Int. J. Speech Technol., 2014
Stochastic feature compensation methods for speaker verification in noisy environments.
Appl. Soft Comput., 2014
Automatic Phonetic Transcription for read, extempore and conversation speech for an Indian language: Bengali.
Proceedings of the Twentieth National Conference on Communications, 2014
A novel boosting algorithm for improved i-vector based speaker verification in noisy environments.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 11th International Conference on Natural Language Processing, 2014
Designing prosody rule-set for converting neutral TTS speech to storytelling style speech for Indian languages: Bengali, Hindi and Telugu.
Proceedings of the Seventh International Conference on Contemporary Computing, 2014
Proceedings of the Seventh International Conference on Contemporary Computing, 2014
2013
Springer Briefs in Electrical and Computer Engineering, Springer, ISBN: 978-1-4614-5143-3, 2013
Springer Briefs in Electrical and Computer Engineering, Springer, ISBN: 978-1-4614-6360-3, 2013
Non-uniform time scale modification using instants of significant excitation and vowel onset points.
Speech Commun., 2013
J. Intell. Syst., 2013
Vowel onset point detection for noisy speech using spectral energy at formant frequencies.
Int. J. Speech Technol., 2013
Int. J. Speech Technol., 2013
Pitch synchronous and glottal closure based speech analysis for language recognition.
Int. J. Speech Technol., 2013
Int. J. Speech Technol., 2013
Characterization and recognition of emotions from speech using excitation source information.
Int. J. Speech Technol., 2013
Two-stage intonation modeling using feedforward neural networks for syllable based text-to-speech synthesis.
Comput. Speech Lang., 2013
Optimal weight tuning method for unit selection cost functions in syllable based text-to-speech synthesis.
Appl. Soft Comput., 2013
Proceedings of the Pattern Recognition and Machine Intelligence, 2013
Proceedings of the Pattern Recognition and Machine Intelligence, 2013
Significance of utterance partitioning in GMM-SVM based speaker verification in varying background environment.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013
Language identification using Hilbert envelope and phase information of linear prediction residual.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013
Phonetic and Prosodically Rich Transcribed speech corpus in Indian languages: Bengali and Odia.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013
Importance of Utterance Partitioning in SVM Classifier with GMM Supervectors for Text-Independent Speaker Verification.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2013
High quality text-to-speech synthesis system with efficient duration models developed using coding schemes based on vowel production characteristics.
Proceedings of the 13th International Conference on Intellient Systems Design and Applications, 2013
Proceedings of the Sixth International Conference on Contemporary Computing, 2013
2012
Springer Briefs in Electrical and Computer Engineering, Springer, ISBN: 978-1-4614-1338-7, 2012
ACM Trans. Speech Lang. Process., 2012
IEEE Trans. Speech Audio Process., 2012
Neural network based feature transformation for emotion independent speaker identification.
Int. J. Speech Technol., 2012
A pitch synchronous approach to design voice conversion system using source-filter correlation.
Int. J. Speech Technol., 2012
Emotion recognition from speech using sub-syllabic and pitch synchronous spectral features.
Int. J. Speech Technol., 2012
Int. J. Speech Technol., 2012
Spotting and Recognition of Consonant-Vowel Units from Continuous Speech Using Accurate Detection of Vowel Onset Points.
Circuits Syst. Signal Process., 2012
Circuits Syst. Signal Process., 2012
Better human computer interaction by enhancing the quality of text-to-speech synthesis.
Proceedings of the 4th International Conference on Intelligent Human Computer Interaction, 2012
Proceedings of the Contemporary Computing - 5th International Conference, 2012
Proceedings of the Contemporary Computing - 5th International Conference, 2012
Emotion Recognition from Semi Natural Speech Using Artificial Neural Networks and Excitation Source Features.
Proceedings of the Contemporary Computing - 5th International Conference, 2012
Proceedings of the Contemporary Computing - 5th International Conference, 2012
Proceedings of the Contemporary Computing - 5th International Conference, 2012
Speaker recognition in the case of emotional environment using transformation of speech features.
Proceedings of the CUBE International IT Conference & Exhibition, 2012
Proceedings of the CUBE International IT Conference & Exhibition, 2012
2011
Recognition of consonant-vowel (CV) units under background noise using combined temporal and spectral preprocessing.
Int. J. Speech Technol., 2011
Int. J. Speech Technol., 2011
Int. J. Speech Technol., 2011
Expert Syst. Appl., 2011
Proceedings of the Contemporary Computing - 4th International Conference, 2011
Proceedings of the Contemporary Computing - 4th International Conference, 2011
Proceedings of the Contemporary Computing - 4th International Conference, 2011
Proceedings of the Contemporary Computing - 4th International Conference, 2011
2010
J. Softw. Eng. Appl., 2010
Voice conversion by mapping the speaker-specific features using pitch synchronous approach.
Comput. Speech Lang., 2010
Proceedings of the Contemporary Computing - Third International Conference, 2010
Proceedings of the Contemporary Computing - Third International Conference, 2010
2009
Speech Commun., 2009
Unit Selection Using Linguistic, Prosodic and Spectral Distance for Developing Text-to-Speech System in Hindi.
Proceedings of the Pattern Recognition and Machine Intelligence, 2009
Proceedings of the Pattern Recognition and Machine Intelligence, 2009
Significance of Word and Syllable Level Information for Expressive Speech Processing.
Proceedings of the Seventh International Conference on Advances in Pattern Recognition, 2009
Proceedings of the Contemporary Computing - Second International Conference, 2009
2008
Proceedings of the Speech, 2008
2007
Determination of Instants of Significant Excitation in Speech Using Hilbert Envelope and Group Delay Function.
IEEE Signal Process. Lett., 2007
Proceedings of the Pattern Recognition and Machine Intelligence, 2007
2006
IEEE Trans. Speech Audio Process., 2006
Proceedings of the 9th International Conference in Information Technology, 2006
2004
Proceedings of the Neural Information Processing, 11th International Conference, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003
2002
Proceedings of the IEEE International Conference on Acoustics, 2002