Hemant Kumar Kathania

Orcid: 0000-0002-6367-5203

According to our database1, Hemant Kumar Kathania authored at least 31 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Spectral warping based data augmentation for low resource children's speaker verification.
Multim. Tools Appl., May, 2024

ResEmoteNet: Bridging Accuracy and Loss Reduction in Facial Emotion Recognition.
CoRR, 2024

Effect of Speech Modification on Wav2Vec2 Models for Children Speech Recognition.
Proceedings of the International Conference on Signal Processing and Communications, 2024

Role of Acoustics and Prosodic Features for Children's Age Classification.
Proceedings of the International Conference on Signal Processing and Communications, 2024

2023
Gammatone-Filterbank Based Pitch-Normalized Cepstral Coefficients for Zero-Resource Children's ASR.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Effect of Linear Prediction Order to Modify Formant Locations for Children Speech Recognition.
Proceedings of the Speech and Computer - 25th International Conference, 2023

2022
Data Augmentation Using Spectral Warping for Low Resource Children ASR.
J. Signal Process. Syst., December, 2022

A formant modification method for improved ASR of children's speech.
Speech Commun., 2022

End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks.
CoRR, 2022

2021
Synthesis Speech Based Data Augmentation for Low Resource Children ASR.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Spectral modification for recognition of children's speech undermismatched conditions.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Speaker Verification Experiments for Adults and Children Using Shared Embedding Spaces.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Vowel Non-Vowel Based Spectral Warping and Time Scale Modification for Improvement in Children's ASR.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Creating speaker independent ASR system through prosody modification based data augmentation.
Pattern Recognit. Lett., 2020

Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge.
CoRR, 2020

Data Augmentation Using Prosody and False Starts to Recognize Non-Native Children's Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Study of Formant Modification for Children ASR.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Developing speaker independent ASR system using limited data through prosody modification based on fuzzy classification of spectral bins.
Digit. Signal Process., 2019

Role of Linear, Mel and Inverse-Mel Filterbanks in Automatic Recognition of Speech from High-Pitched Speakers.
Circuits Syst. Signal Process., 2019

Speaking-Rate Adaptation of Automatic Speech Recognition System through Fuzzy Classification based Time-Scale Modification.
Proceedings of the National Conference on Communications, 2019

On the Role of Linear, Mel and Inverse-Mel Filterbank in the Context of Automatic Speech Recognition.
Proceedings of the National Conference on Communications, 2019

2018
Improving children's mismatched ASR using structured low-rank feature projection.
Speech Commun., 2018

Studying the role of pitch-adaptive spectral estimation and speaking-rate normalization in automatic speech recognition.
Digit. Signal Process., 2018

An Experimental Study on the Significance of Variable Frame-Length and Overlap in the Context of Children's Speech Recognition.
Circuits Syst. Signal Process., 2018

Explicit Pitch Mapping for Improved Children's Speech Recognition.
Circuits Syst. Signal Process., 2018

Exploring the Role of Speaking-Rate Adaptation on Children's Speech Recognition.
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Improving Children's Speech Recognition Through Time Scale Modification Based Speaking Rate Adaptation.
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Role of Prosodic Features on Children's Speech Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Effect of Prosody Modification on Children's ASR.
IEEE Signal Process. Lett., 2017

Improving children speech recognition in acoustically mismatched condition using eigenvoices and feature projections.
Proceedings of the Twenty-third National Conference on Communications, 2017

Improving Children's Speech Recognition Through Explicit Pitch Scaling Based on Iterative Spectrogram Inversion.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017


  Loading...