S. R. Mahadeva Prasanna
Orcid: 0000-0002-8135-7938
According to our database1,
S. R. Mahadeva Prasanna
authored at least 244 papers
between 2002 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
ACM Trans. Asian Low Resour. Lang. Inf. Process., October, 2024
ACM Trans. Multim. Comput. Commun. Appl., August, 2024
IEEE J. Sel. Top. Signal Process., May, 2024
IEEE ACM Trans. Audio Speech Lang. Process., 2024
IEEE Signal Process. Lett., 2024
Digit. Signal Process., 2024
Representation Loss Minimization with Randomized Selection Strategy for Efficient Environmental Fake Audio Detection.
CoRR, 2024
CoRR, 2024
Strong Alone, Stronger Together: Synergizing Modality-Binding Foundation Models with Optimal Transport for Non-Verbal Emotion Recognition.
CoRR, 2024
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models.
CoRR, 2024
The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments.
CoRR, 2024
Depression Classification Using Token Merging-Based Speech Spectrotemporal Transformer.
Proceedings of the Speech and Computer - 26th International Conference, 2024
Proceedings of the International Conference on Signal Processing and Communications, 2024
Depression Classification Using Log-Mel Spectrograms: A Comparative Analysis of Window Size-Based Data Augmentation and Deep Learning Models.
Proceedings of the 27th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2024
Proceedings of the National Conference on Communications, 2024
Evaluating the Efficacy of Large Acoustic Model for Documenting Non-Orthographic Tribal Languages in India.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
2023
Biomed. Signal Process. Control., May, 2023
Clean vs. Overlapped Speech-Music Detection Using Harmonic-Percussive Features and Multi-Task Learning.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Proceedings of the Speech and Computer - 25th International Conference, 2023
Proceedings of the Speech and Computer - 25th International Conference, 2023
Proceedings of the Speech and Computer - 25th International Conference, 2023
Significance of Indic Self-supervised Speech Representations for Indic Under-Resourced ASR.
Proceedings of the Speech and Computer - 25th International Conference, 2023
Proceedings of the Speech and Computer - 25th International Conference, 2023
Proceedings of the Speech and Computer - 25th International Conference, 2023
Proceedings of the Speech and Computer - 25th International Conference, 2023
Preliminary Analysis of Lambani Vowels and Vowel Classification Using Acoustic Features.
Proceedings of the Speech and Computer - 25th International Conference, 2023
Bridging the Gap: Towards Linguistic Resource Development for the Low-Resource Lambani Language.
Proceedings of the Speech and Computer - 25th International Conference, 2023
Proceedings of the Speech and Computer - 25th International Conference, 2023
Post-processing of Translated Speech by Pole Modification and Residual Enhancement to Improve Perceptual Quality.
Proceedings of the Speech and Computer - 25th International Conference, 2023
Proceedings of the Speech and Computer - 25th International Conference, 2023
Optimizing Direct Speech-to-Text Translation for un-orthographic low-resource tribal languages using source transliterations.
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023
Leveraging Cross Lingual Speech Representations To Build ASR For Under-resourced Languages.
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023
Comparative Analysis of Direct Speech-to-Speech Translation and Voice Conversion Using Bi-LSTM.
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023
Proceedings of the 28th National Conference on Communications, 2023
Investigation Of Data Augmentation Techniques For Bi-LSTM Based Direct Speech To Speech Translation.
Proceedings of the 28th National Conference on Communications, 2023
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023
Classification of Cleft Lip and Palate Speech Using Fine-Tuned Transformer Pretrained Models.
Proceedings of the Intelligent Human Computer Interaction - 15th International Conference, 2023
2022
Speech Commun., 2022
Biomed. Signal Process. Control., 2022
Importance of Supra-Segmental Information and Self-Supervised Framework for Spoken Language Diarization Task.
Proceedings of the Speech and Computer - 24th International Conference, 2022
Proceedings of the Speech and Computer - 24th International Conference, 2022
Proceedings of the Speech and Computer - 24th International Conference, 2022
Proceedings of the Speech and Computer - 24th International Conference, 2022
Proceedings of the Speech and Computer - 24th International Conference, 2022
Proceedings of the Speech and Computer - 24th International Conference, 2022
Issues in Sub-Utterance Level Language Identification in a Code Switched Bilingual Scenario.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022
Foreground-Background Audio Separation using Spectral Peaks based Time-Frequency Masks.
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022
Proceedings of the IEEE International Conference on Signal Processing and Communications, 2022
Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022
Analysis of Layer-Wise Training in Direct Speech to Speech Translation Using BI-LSTM.
Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022
Analyzing RMFCC Feature for Dialect Identification in Ao, an Under-Resourced Language.
Proceedings of the 27th National Conference on Communications, 2022
Low-Resource Dialect Identification in Ao Using Noise Robust Mean Hilbert Envelope Coefficients.
Proceedings of the 27th National Conference on Communications, 2022
Importance of excitation source and sequence learning towards spoken language identification task.
Proceedings of the 27th National Conference on Communications, 2022
Significance of Prosody Modification in Privacy Preservation on speaker verification.
Proceedings of the 27th National Conference on Communications, 2022
Machine Translation for a Very Low-Resource Language - Layer Freezing Approach on Transfer Learning.
Proceedings of the Fifth Workshop on Technologies for Machine Translation of Low-Resource Languages, 2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Circuits Syst. Signal Process., 2021
Exploration of Visual Features and their weighted-additive fusion for Video Captioning.
CoRR, 2021
Biomed. Signal Process. Control., 2021
Audio-Visual Biometric Recognition and Presentation Attack Detection: A Comprehensive Survey.
IEEE Access, 2021
Proceedings of the Speech and Computer - 23rd International Conference, 2021
Enhancing the Intelligibility of Cleft Lip and Palate Speech Using Cycle-Consistent Adversarial Networks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Excitation Source Feature Based Dialect Identification in Ao - A Low Resource Language.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Alzheimer's Dementia Recognition Using Multimodal Fusion of Speech and Text Embeddings.
Proceedings of the Intelligent Human Computer Interaction, 2021
Exploring Multimodal Features and Fusion for Time-Continuous Prediction of Emotional Valence and Arousal.
Proceedings of the Intelligent Human Computer Interaction, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Vowel Onset Point Based Screening of Misarticulated Stops in Cleft Lip and Palate Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2020
IEEE ACM Trans. Audio Speech Lang. Process., 2020
Speech Commun., 2020
Sinusoidal model-based hypernasality detection in cleft palate speech using CVCV sequence.
Speech Commun., 2020
Proceedings of the International Conference on Signal Processing and Communications, 2020
Overlapped/Non-Overlapped Speech Transition Point Detection Using Bag-of-Audio-Words.
Proceedings of the International Conference on Signal Processing and Communications, 2020
Proceedings of the 2020 National Conference on Communications, 2020
Analysis of Excitation Source Characteristics for Shouted and Normal Speech Classification.
Proceedings of the 2020 National Conference on Communications, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Spectral Moment and Duration of Burst of Plosives in Speech of Children with Hearing Impairment and Typically Developing Children - A Comparative Study.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2019
Detection of Nasalized Voiced Stops in Cleft Palate Speech Using Epoch-Synchronous Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2019
Pattern Recognit. Lett., 2019
Pattern Recognit., 2019
Int. J. Speech Technol., 2019
An improved discriminative region selection methodology for online handwriting recognition.
Int. J. Document Anal. Recognit., 2019
Exploiting forced alignment of time-reversed data for improving HMM-based handwriting segmentation.
Expert Syst. Appl., 2019
Speech Enhancement Using Source Information for Phoneme Recognition of Speech with Background Music.
Circuits Syst. Signal Process., 2019
Investigating Text-Independent Speaker Verification Systems Under Varied Data Conditions.
Circuits Syst. Signal Process., 2019
Exploring Text-Constraint Models and Source Information for Long-Enrollment with Short-Test Speaker Verification.
Circuits Syst. Signal Process., 2019
Circuits Syst. Signal Process., 2019
Proceedings of the TENCON 2019, 2019
Proceedings of the TENCON 2019, 2019
Proceedings of the Pattern Recognition and Machine Intelligence, 2019
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019
Proceedings of the National Conference on Communications, 2019
Proceedings of the National Conference on Communications, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Significance of sonority information for voiced/unvoiced decision in speech synthesis.
Speech Commun., 2018
Speech Commun., 2018
Automatic syllabification of speech signal using short time energy and vowel onset points.
Int. J. Speech Technol., 2018
Significance of duration modification for speaker verification under mismatch speech tempo condition.
Int. J. Speech Technol., 2018
Int. J. Speech Technol., 2018
Expert Syst. Appl., 2018
Circuits Syst. Signal Process., 2018
End Point Detection Using Speech-Specific Knowledge for Text-Dependent Speaker Verification.
Circuits Syst. Signal Process., 2018
Proceedings of the TENCON 2018, 2018
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018
Processing Transition Regions of Glottal Stop Substituted /S/ for Intelligibility Enhancement of Cleft Palate Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Detection of Glottal Activity Errors in Production of Stop Consonants in Children with Cleft Lip and Palate.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Epoch Extraction from Pathological Children Speech Using Single Pole Filtering Approach.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Self-similarity Matrix Based Intelligibility Assessment of Cleft Lip and Palate Speech.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
AGROASSAM: A Web Based Assamese Speech Recognition Application for Retrieving Agricultural Commodity Price and Weather Information.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Glotto Vibrato Graph: A Device and Method for Recording, Analysis and Visualization of Glottal Activity.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 24th International Conference on Pattern Recognition, 2018
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018
Proceedings of the 16th International Conference on Frontiers in Handwriting Recognition, 2018
Investigating Text-independent Speaker Verification from Practically Realizable System Perspective.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018
2017
Improvements in IITG Assamese Spoken Query System: Background Noise Suppression and Alternate Acoustic Modeling.
J. Signal Process. Syst., 2017
J. Signal Process. Syst., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
IEEE ACM Trans. Audio Speech Lang. Process., 2017
Speech Commun., 2017
Speech Commun., 2017
Consonant-vowel unit recognition using dominant aperiodic and transition region detection.
Speech Commun., 2017
Exploring kernel discriminant analysis for speaker verification with limited test data.
Pattern Recognit. Lett., 2017
Int. J. Speech Technol., 2017
Int. J. Speech Technol., 2017
Int. J. Speech Technol., 2017
Improved voicing decision using glottal activity features for statistical parametric speech synthesis.
Digit. Signal Process., 2017
Vowel onset point based characterization of velopharyngeal activity using imaging techniques.
Proceedings of the Twenty-third National Conference on Communications, 2017
Proceedings of the Twenty-third National Conference on Communications, 2017
Proceedings of the Twenty-third National Conference on Communications, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Hypernasality Severity Analysis in Cleft Lip and Palate Speech Using Vowel Space Area.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Phase Modeling Using Integrated Linear Prediction Residual for Statistical Parametric Speech Synthesis.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016
Foreground Speech Segmentation and Enhancement Using Glottal Closure Instants and Mel Cepstral Coefficients.
IEEE ACM Trans. Audio Speech Lang. Process., 2016
A better decomposition of speech obtained using modified Empirical Mode Decomposition.
Digit. Signal Process., 2016
Digit. Signal Process., 2016
Circuits Syst. Signal Process., 2016
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016
Frequency count based two stage classification for online handwritten character recognition.
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016
Significance of constraining text in limited data text-independent speaker verification.
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016
Speech Synthesis in Noisy Environment by Enhancing Strength of Excitation and Formant Prominence.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Exploring Session Variability and Template Aging in Speaker Verification for Fixed Phrase Short Utterances.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
Low Complexity On-Line Adaptation Techniques in Context of Assamese Spoken Query System.
J. Signal Process. Syst., 2015
IEEE Signal Process. Lett., 2015
Processing of linear prediction residual in spectral and cepstral domains for speaker information.
Int. J. Speech Technol., 2015
Circuits Syst. Signal Process., 2015
Proceedings of the Twenty First National Conference on Communications, 2015
Proceedings of the Twenty First National Conference on Communications, 2015
Curvature point based HMM state prediction for online handwritten assamese strokes recognition.
Proceedings of the Twenty First National Conference on Communications, 2015
Proceedings of the Twenty First National Conference on Communications, 2015
Proceedings of the Twenty First National Conference on Communications, 2015
Comparison of assamese character recognizer using stroke level and character level engines.
Proceedings of the Twenty First National Conference on Communications, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
2014
IEEE Signal Process. Lett., 2014
Online Stroke and Akshara Recognition GUI in Assamese Language Using Hidden Markov Model.
CoRR, 2014
Proceedings of the Twentieth National Conference on Communications, 2014
Epochs based compression of LP residual for source modeling in text-to-speech synthesis.
Proceedings of the Twentieth National Conference on Communications, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the Blizzard Challenge 2014, Singapore, Singapore, September 19, 2014, 2014
2013
IEEE Trans. Speech Audio Process., 2013
Int. J. Speech Technol., 2013
Development and evaluation of online text-independent speaker verification system for remote person authentication.
Int. J. Speech Technol., 2013
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
Int. J. Speech Technol., 2012
Int. J. Speech Technol., 2012
Int. J. Speech Technol., 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the Proceeding of the workshop on Document Analysis and Recognition, 2012
2011
Significance of Vowel-Like Regions for Speaker Verification Under Degraded Conditions.
IEEE ACM Trans. Audio Speech Lang. Process., 2011
Speech Commun., 2011
Recognition of consonant-vowel (CV) units under background noise using combined temporal and spectral preprocessing.
Int. J. Speech Technol., 2011
Int. J. Speech Technol., 2011
Int. J. Speech Technol., 2011
Subsegmental, segmental and suprasegmental processing of linear prediction residual for speaker information.
Int. J. Speech Technol., 2011
Expert Syst. Appl., 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Study of robustness of zero frequency resonator method for extraction of fundamental frequency.
Proceedings of the IEEE International Conference on Acoustics, 2011
Chain Code Histogram Based Facial Image Feature Extraction under Degraded Conditions.
Proceedings of the Advances in Computing and Communications, 2011
2010
Int. J. Speech Technol., 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Analysis of instantaneous F0 contours from two speakers mixed signal using zero frequency filtering.
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Vowel Onset Point Detection Using Source, Spectral Peaks, and Modulation Spectrum Energies.
IEEE Trans. Speech Audio Process., 2009
Speaker Recognition under Limited Data Condition using LVQ and GMM-UBM.
Proceedings of the 4th Indian International Conference on Artificial Intelligence, 2009
Significance of Word and Syllable Level Information for Expressive Speech Processing.
Proceedings of the Seventh International Conference on Advances in Pattern Recognition, 2009
2007
Determination of Instants of Significant Excitation in Speech Using Hilbert Envelope and Group Delay Function.
IEEE Signal Process. Lett., 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Significance of Multimodal Biometric Systems.
Proceedings of the 3rd Indian International Conference on Artificial Intelligence, 2007
Speaker Recognition in Limited Data Conditions using Self-Organizing Map.
Proceedings of the 3rd Indian International Conference on Artificial Intelligence, 2007
Proceedings of the Frontiers in the Convergence of Bioscience and Information Technologies 2007, 2007
2006
Extraction of speaker-specific excitation information from linear prediction residual of speech.
Speech Commun., 2006
Proceedings of the Information Systems Security, Second International Conference, 2006
2005
Combining evidence from source, suprasegmental and spectral features for a fixed-text speaker verification system.
IEEE Trans. Speech Audio Process., 2005
IEEE Trans. Speech Audio Process., 2005
IEEE Trans. Speech Audio Process., 2005
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Text-Dependent Writer Identification using Word Length Analysis.
Proceedings of the 2nd Indian International Conference on Artificial Intelligence, 2005
2004
Proceedings of the Odyssey 2004: The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the Neural Information Processing, 11th International Conference, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002
Proceedings of the IEEE International Conference on Acoustics, 2002