Suryakanth V. Gangashetty

Orcid: 0000-0001-6745-4363

According to our database1, Suryakanth V. Gangashetty authored at least 95 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Variational mode decomposition based features for detection of hypernasality in cleft palate speech.
Biomed. Signal Process. Control., 2024

Significance of Variational Mode Decomposition for Epoch Based Prosody Modification of Speech With Clipping Distortions.
IEEE Access, 2024

2023
Epoch Extraction from Telephonic Speech Signal using Stockwell Transform.
Circuits Syst. Signal Process., July, 2023

SPRING-INX: A Multilingual Indian Language Speech Corpus by SPRING Lab, IIT Madras.
CoRR, 2023

Significance of Duration Modification in Reducing Listening Effort of Slurred Speech from Patients with Traumatic Brain Injury.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Improved Epoch Based Prosody Modification by Zero Frequency Filtering of Gabor Filtered Telephonic Speech.
Proceedings of the 28th National Conference on Communications, 2023

2022
Resources and Benchmarks for Keyword Search in Spoken Audio From Low-Resource Indian Languages.
IEEE Access, 2022

Significance of Dimensionality Reduction in CNN-Based Vowel Classification from Imagined Speech Using Electroencephalogram Signals.
Proceedings of the Speech and Computer - 24th International Conference, 2022

2021
NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain.
EURASIP J. Audio Speech Music. Process., 2021

A novel stacking technique for prediction of diabetes.
Comput. Biol. Medicine, 2021

Analysis of Glottal Activity of High Arousal and Falsetto Voice.
Proceedings of the 2021 Workshop on Speech, Music and Mind, 2021

Contribution of F0 Contour Level, F0 Contour Shape and Durations Towards Perception of Lombard Speech.
Proceedings of the 2021 Workshop on Speech, Music and Mind, 2021

Raga Classification in Carnatic Music Using Audio Thumbnailing.
Proceedings of the Pattern Recognition and Machine Intelligence, 2021

A Generative Adversarial Network based Training Framework for Robust TTS in Noisy Environment.
Proceedings of the IC3 2021: Thirteenth International Conference on Contemporary Computing, Noida, India, August 5, 2021

2020
Learning Document Embeddings Along With Their Uncertainties.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference.
Circuits Syst. Signal Process., 2020

Mel-Weighted Single Frequency Filtering Spectrogram for Dialect Identification.
IEEE Access, 2020

Zero-Time Windowing Cepstral Coefficients for Dialect Classification.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Spectral Features derived from Single Frequency Filter for Multispeaker Localization.
Proceedings of the 2020 National Conference on Communications, 2020

IIIT-H TEMD Semi-Natural Emotional Speech Database from Professional Actors and Non-Actors.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Acoustic Scene Classification using Single Frequency Filtering Cepstral Coefficients and DNN.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Learning Filterbanks from Raw Waveform for Accent Classification.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

Speech Based Access of Kisan Information System in Telugu Language.
Proceedings of the Intelligent Human Computer Interaction, 2020

Study of Closed Phase Resonance Bandwidths for Oral and Nasal Tracts Using Zero Time Windowing.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Excitation Source and Vocal Tract System Based Acoustic Features for Detection of Nasals in Continuous Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

An Approach to Cross-Lingual Voice Conversion.
Proceedings of the International Joint Conference on Neural Networks, 2019

Epoch Extraction from Speech Signals Using Temporal and Spectral Cues by Exploiting Harmonic Structure of Impulse-like Excitations.
Proceedings of the IEEE International Conference on Acoustics, 2019

A New Weighted NMF Algorithm For Missing Data Interpolation And Its Application To Speech Enhancement.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
Improved vowel region detection from a continuous speech using post processing of vowel onset points and vowel end-points.
Multim. Tools Appl., 2018

Combining evidences from excitation source and vocal tract system features for Indian language identification using deep neural networks.
Int. J. Speech Technol., 2018

Estimation of Vocal Tract Resonances Using Spectral Prominent Regions and Artificial Neural Networks.
Circuits Syst. Signal Process., 2018

Decision Level Fusion based Approach for Indian Languages Identification using Deep Neural Network.
Proceedings of the TENCON 2018, 2018

Time-frequency spectral error for analysis of high arousal speech.
Proceedings of the 2018 Workshop on Speech, Music and Mind, 2018

Development of IIITH Hindi-English Code Mixed Speech Database.
Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

Phonetically Balanced Code-Mixed Speech Corpus for Hindi-English Automatic Speech Recognition.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Discriminating Nasals and Approximants in English Language Using Zero Time Windowing.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

DNN based Acoustic Scene Classification using Score Fusion of MFCC and Inverse MFCC.
Proceedings of the 13th IEEE International Conference on Industrial and Information Systems, 2018

Input Fusion of MFCC and SCMC Features for Acoustic Scene Classification using DNN.
Proceedings of the 13th IEEE International Conference on Industrial and Information Systems, 2018

2017
Deep Elman recurrent neural networks for statistical parametric speech synthesis.
Speech Commun., 2017

Significance of DNN-AM for Multimodal Sentiment Analysis.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2017

Locating Burst Onsets Using SFF Envelope and Phase Information.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Detection of Replay Attacks Using Single Frequency Filtering Cepstral Coefficients.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

SFF Anti-Spoofer: IIIT-H Submission for Automatic Speaker Verification Spoofing and Countermeasures Challenge 2017.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Towards developing a phonetically balanced code-mixed speech corpus for Hindi-English ASR.
Proceedings of the 14th International Conference on Natural Language Processing, 2017

Topic identification of spoken documents using unsupervised acoustic unit discovery.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Sentiment analysis using relative prosody features.
Proceedings of the Tenth International Conference on Contemporary Computing, 2017

Adapting monolingual resources for code-mixed hindi-english speech recognition.
Proceedings of the 2017 International Conference on Asian Language Processing, 2017

IIITH Submission for Blizzard Challenge 2017: A BLSTM based SPSS System using MatNN.
Proceedings of the Blizzard Challenge 2017, Stockholm, Sweden, August 25, 2017, 2017

Importance of non-uniform prosody modification for speech recognition in emotion conditions.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

Acoustic analysis of infant cry signal towards automatic detection of the cause of crying.
Proceedings of the Seventh International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2017

2016
Developing a unit selection voice given audio without corresponding text.
EURASIP J. Audio Speech Music. Process., 2016

Statistical Parametric Speech Synthesis Using Bottleneck Representation From Sequence Auto-encoder.
CoRR, 2016

Contextual Representation using Recurrent Neural Network Hidden State for Statistical Parametric Speech Synthesis.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Investigating Signal Correlation as Continuity Metric in a Syllable Based Unit Selection Synthesis System.
Proceedings of the Speech and Computer - 18th International Conference, 2016

A Study on Vowel Region Detection from a Continuous Speech.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2016

A Study on Text-Independent Speaker Recognition Systems in Emotional Conditions Using Different Pattern Recognition Models.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2016

Multimodal Sentiment Analysis Using Deep Neural Networks.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2016

An Investigation of Recurrent Neural Network Architectures Using Word Embeddings for Phrase Break Prediction.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

An Investigation of Deep Neural Network Architectures for Language Recognition in Indian Languages.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Robust Vowel Landmark Detection Using Epoch-Based Features.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Use of Vowels in Discriminating Speech-Laugh from Laughter and Neutral Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Analysis of sequence to sequence neural networks on grapheme to phoneme conversion task.
Proceedings of the 2016 International Joint Conference on Neural Networks, 2016

IIIT Hyderabad's entry to Blizzard Challenge 2016.
Proceedings of the Blizzard Challenge 2016, Cuppertino, CA, USA, September 16, 2016, 2016

2015
Significance of Maximum Spectral Amplitude in Sub-bands for Spectral Envelope Estimation and Its Application to Statistical Parametric Speech Synthesis.
CoRR, 2015

Significance of Emotionally Significant Regions of Speech for Emotive to Neutral Conversion.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2015

Improved Language Identification in Presence of Speech Coding.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2015

Analysis of excitation source features of speech for emotion recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Robust features for sonorant segmentation in continuous speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

An investigation of recurrent neural network architectures for statistical parametric speech synthesis.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

IIIT Hyderabad's submission to the Blizzard Challenge 2015.
Proceedings of the Blizzard Challenge 2015, 2015


Learning continuous representation of text for phone duration modeling in statistical parametric speech synthesis.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Excitation source features for discrimination of anger and happy emotions.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Analysis of laughter and speech-laugh signals using excitation source information.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Analysis of Acoustic Events in Speech Signals Using Bessel Series Expansion.
Circuits Syst. Signal Process., 2013

2012
Nonlinear principal component analysis for seismic data compression.
Proceedings of the 1st International Conference on Recent Advances in Information Technology, 2012

Edge detection for facial images under noisy conditions.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Integration of Multimodal Interaction as Assistance in Virtual Environments.
Proceedings of the 1st Workshop on Speech and Multimodal Interaction in Assistive Environments, 2012

2011
Exploring Bessel Features for Detection of Glottal Closure Instants.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Fourier - Bessel based Cepstral Coefficient Features for Text-Independent Speaker Identification.
Proceedings of the 5th Indian International Conference on Artificial Intelligence, 2011

Decomposition of speech signals for analysis of aperiodic components of excitation.
Proceedings of the IEEE International Conference on Acoustics, 2011

Acoustic-phonetic information from excitation source for refining manner hypotheses of a phone recognizer.
Proceedings of the IEEE International Conference on Acoustics, 2011

Games for Academic Vocabulary Learning through a Virtual Environment.
Proceedings of the International Conference on Asian Language Processing, 2011

Multilayer Feedforward Neural Network Models for Pattern Recognition Tasks in Earthquake Engineering.
Proceedings of the Advanced Computing, Networking and Security - International Conference, 2011

2010
AM-FM model based approach for detection of glottal closure instants.
Proceedings of the 10th International Conference on Information Sciences, 2010

Detection of voice onset time using FB expansion and AM-FM model.
Proceedings of the 10th International Conference on Information Sciences, 2010

Significance of pitch synchronous analysis for speaker recognition using AANN models.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Analysis of Lombard speech using excitation source information.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Vocal Tract System Characteristics for Studies on Vowels.
Proceedings of the 4th Indian International Conference on Artificial Intelligence, 2009

2007
Application of Euclidean Distance in Optimal Coupling used in Text to Speech Synthesis.
Proceedings of the 3rd Indian International Conference on Artificial Intelligence, 2007

2005
Spotting Multilingual Consonant-Vowel Units of Speech Using Neural Network Models.
Proceedings of the Nonlinear Analyses and Algorithms for Speech Processing, 2005

Support Vector Machines for Face Recognition.
Proceedings of the 2nd Indian International Conference on Artificial Intelligence, 2005

2004
Detection of vowel on set points in continuous speech using autoassociative neural network models.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003
Constraint satisfaction model for enhancement of evidence in recognition of consonant-vowel utterances.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Linear and nonlinear compression of feature vectors for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002


  Loading...