Hemant Kumar Kathania

Virender Kadyan

Multim. Tools Appl., May, 2024

ResEmoteNet: Bridging Accuracy and Loss Reduction in Facial Emotion Recognition.

[BibT_eX]

[DOI]

Arnab Kumar Roy

Adhitiya Sharma

Abhishek Dey

Md. Sarfaraj Alam Ansari

CoRR, 2024

Effect of Speech Modification on Wav2Vec2 Models for Children Speech Recognition.

[BibT_eX]

[DOI]

Abhijit Sinha

Proceedings of the International Conference on Signal Processing and Communications, 2024

Role of Acoustics and Prosodic Features for Children's Age Classification.

[BibT_eX]

[DOI]

Vishakha Kumari

Abhijit Sinha

Proceedings of the International Conference on Signal Processing and Communications, 2024

2023

Gammatone-Filterbank Based Pitch-Normalized Cepstral Coefficients for Zero-Resource Children's ASR.

[BibT_eX]

[DOI]

Ankita

Avinash Kumar

Proceedings of the Speech and Computer - 25th International Conference, 2023

Effect of Linear Prediction Order to Modify Formant Locations for Children Speech Recognition.

[BibT_eX]

[DOI]

Udara Laxman Kumar

Proceedings of the Speech and Computer - 25th International Conference, 2023

2022

Data Augmentation Using Spectral Warping for Low Resource Children ASR.

[BibT_eX]

[DOI]

Virender Kadyan

J. Signal Process. Syst., December, 2022

A formant modification method for improved ASR of children's speech.

[BibT_eX]

[DOI]

Paavo Alku

Speech Commun., 2022

End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks.

[BibT_eX]

[DOI]

Tamás Grósz

CoRR, 2022

2021

Synthesis Speech Based Data Augmentation for Low Resource Children ASR.

[BibT_eX]

[DOI]

Virender Kadyan

Prajjval Govil

Proceedings of the Speech and Computer - 23rd International Conference, 2021

Spectral modification for recognition of children's speech undermismatched conditions.

[BibT_eX]

[DOI]

Paavo Alku

Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Speaker Verification Experiments for Adults and Children Using Shared Embedding Spaces.

[BibT_eX]

[DOI]

Tuomas Kaseva

Aku Rouhe

Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Vowel Non-Vowel Based Spectral Warping and Time Scale Modification for Improvement in Children's ASR.

[BibT_eX]

[DOI]

Avinash Kumar

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Creating speaker independent ASR system through prosody modification based data augmentation.

[BibT_eX]

[DOI]

B. Tarun Sai

Pattern Recognit. Lett., 2020

Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge.

[BibT_eX]

[DOI]

Tamás Grósz

CoRR, 2020

Data Augmentation Using Prosody and False Starts to Recognize Non-Native Children's Speech.

[BibT_eX]

[DOI]

Tamás Grósz

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Study of Formant Modification for Children ASR.

[BibT_eX]

[DOI]

Paavo Alku

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Developing speaker independent ASR system using limited data through prosody modification based on fuzzy classification of spectral bins.

[BibT_eX]

[DOI]

Digit. Signal Process., 2019

Role of Linear, Mel and Inverse-Mel Filterbanks in Automatic Recognition of Speech from High-Pitched Speakers.

[BibT_eX]

[DOI]

Circuits Syst. Signal Process., 2019

Speaking-Rate Adaptation of Automatic Speech Recognition System through Fuzzy Classification based Time-Scale Modification.

[BibT_eX]

[DOI]

B. Tarun Sai

Proceedings of the National Conference on Communications, 2019

On the Role of Linear, Mel and Inverse-Mel Filterbank in the Context of Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the National Conference on Communications, 2019

2018

Improving children's mismatched ASR using structured low-rank feature projection.

[BibT_eX]

[DOI]

Abhishek Dey

Rohit Sinha

Speech Commun., 2018

Studying the role of pitch-adaptive spectral estimation and speaking-rate normalization in automatic speech recognition.

[BibT_eX]

[DOI]

Rohit Sinha

Digit. Signal Process., 2018

An Experimental Study on the Significance of Variable Frame-Length and Overlap in the Context of Children's Speech Recognition.

[BibT_eX]

[DOI]

Chaman Singh

Circuits Syst. Signal Process., 2018

Explicit Pitch Mapping for Improved Children's Speech Recognition.

[BibT_eX]

[DOI]

Arun B. Samaddar

Circuits Syst. Signal Process., 2018

Exploring the Role of Speaking-Rate Adaptation on Children's Speech Recognition.

[BibT_eX]

[DOI]

Chaman Singh

Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Improving Children's Speech Recognition Through Time Scale Modification Based Speaking Rate Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Role of Prosodic Features on Children's Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Effect of Prosody Modification on Children's ASR.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2017

Improving children speech recognition in acoustically mismatched condition using eigenvoices and feature projections.

[BibT_eX]

[DOI]

Rohit Sinha

Proceedings of the Twenty-third National Conference on Communications, 2017

Improving Children's Speech Recognition Through Explicit Pitch Scaling Based on Iterative Spectrogram Inversion.

[BibT_eX]

[DOI]