Maulik C. Madhavi

Orcid: 0000-0001-8743-6437

According to our database1, Maulik C. Madhavi authored at least 42 papers between 2011 and 2021.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2021
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Diagnosis of COVID-19 Using Auditory Acoustic Cues.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Leveraging Acoustic and Linguistic Embeddings from Pretrained Speech and Language Models for Intent Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multi-Target DoA Estimation with an Audio-Visual Fusion Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2021

Exploring Teacher-Student Learning Approach for Multi-Lingual Speech-to-Intent Classification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Speaker-Utterance Dual Attention for Speaker and Utterance Verification.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transformer-based Arabic Dialect Identification.
Proceedings of the International Conference on Asian Language Processing, 2020

Query-By-Example Spoken Term Detection Using Generative Adversarial Network.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection.
Comput. Speech Lang., 2019

RSL2019: A Realistic Speech Localization Corpus.
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

First Leap Towards Development of Dialogue System for Autonomous Bus.
Proceedings of the Increasing Naturalness and Flexibility in Spoken Dialogue Interaction, 2019

A Unified Framework for Speaker and Utterance Verification.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Combining evidences from magnitude and phase information using VTEO for person recognition using humming.
Comput. Speech Lang., 2018

Design of mixture of GMMs for Query-by-Example Spoken Term Detection.
Comput. Speech Lang., 2018

Unsupervised Vocal Tract Length Warped Posterior Features for Non-Parallel Voice Conversion.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Replay Spoof Detection using Power Function Based Features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

Compensating Utterance Information in Fixed Phrase Speaker Verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Partial matching and search space reduction for QbE-STD.
Comput. Speech Lang., 2017

Spoken Keyword Retrieval Using Source and System Features.
Proceedings of the Pattern Recognition and Machine Intelligence, 2017

Two Stage Zero-resource Approaches for QbE-STD.
Proceedings of the Ninth International Conference on Advances in Pattern Recognition, 2017

VTLN-warped Gaussian posteriorgram for QbE-STD.
Proceedings of the 25th European Signal Processing Conference, 2017

Combining evidences from detection sources for query-by-example spoken term detection.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Modification in sequential dynamic time warping for fast computation of query-by-example spoken term detection task.
Proceedings of the 2016 International Conference on Signal Processing and Communications (SPCOM), 2016

Native Language Identification Using Spectral and Source-Based Features.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Vocal Tract Length Normalization Features for Audio Search.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Significance of Phase-based Features for Person Recognition Using Humming.
Proceedings of the 2nd International Conference on Perception and Machine Intelligence, 2015

Spectral transition measure for detection of obstruents.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Development of vocal tract length normalized phonetic engine for Gujarati and Marathi languages.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Exploiting speech source information for vowel landmark detection for low resource language.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Deterministic annealing EM algorithm for developing TTS system in Gujarati.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Exploiting Variable length Teager Energy Operator in melcepstral features for person recognition from humming.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A spectral transition measure based MELCEPSTRAL features for obstruent detection.
Proceedings of the 2014 International Conference on Asian Language Processing, 2014

Vocal tract length normalization for vowel recognition in low resource languages.
Proceedings of the 2014 International Conference on Asian Language Processing, 2014

Development of language resources for speech application in Gujarati and Marathi.
Proceedings of the 2014 International Conference on Asian Language Processing, 2014

2013
Speaker Recognition Using Sparse Representation via Superimposed Features.
Proceedings of the Pattern Recognition and Machine Intelligence, 2013

Development of speech corpora in Gujarati and Marathi for phonetic transcription.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

2012
Static and dynamic information derived from source and system features for person recognition from humming.
Int. J. Speech Technol., 2012

Combining Evidence from Temporal and Spectral Features for Person Recognition Using Humming.
Proceedings of the Perception and Machine Intelligence - First Indo-Japan Conference, 2012

Significance of magnitude and phase information via VTEO for humming based biometrics.
Proceedings of the 5th IAPR International Conference on Biometrics, 2012

Phonetic Transcription of Fricatives and Plosives for Gujarati and Marathi Languages.
Proceedings of the 2012 International Conference on Asian Language Processing, 2012

Person Recognition Using Humming, Singing and Speech.
Proceedings of the 2012 International Conference on Asian Language Processing, 2012

2011
Combining Evidence from Spectral and Source-Like Features for Person Recognition from Humming.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011


  Loading...