Hynek Boril

Mehdi Roopaei

Proceedings of the 14th IEEE Annual Computing and Communication Workshop and Conference, 2024

2018

On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks.

[BibT_eX]

[DOI]

Speech Commun., 2018

2017

Deep neural network training for whispered speech recognition using small databases and generative model sampling.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2017

2016

Generative Modeling of Pseudo-Whisper for Robust Whispered Speech Recognition.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Robustness in Speech, Speaker, and Language Recognition: "You've Got to Know Your Limitations".

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

A new front-end for classification of non-speech sounds: a study on human whistle.

[BibT_eX]

[DOI]

Mahesh Kumar Nandwana

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Generative modeling of pseudo-target domain adaptation samples for whispered speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Automatic assessment of language background in toddlers through phonotactic and pitch pattern modeling of short vocalizations.

[BibT_eX]

[DOI]

Proceedings of the 4st Workshop on Child, Computer and Interaction, 2014

Model and feature based compensation for whispered speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

UT-Vocal Effort II: Analysis and constrained-lexicon recognition of whispered speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Multi-modal highlight generation for sports videos using an information-theoretic excitability measure.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2013

I4u submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Automatic regularization of cross-entropy cost for speaker recognition fusion.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A preliminary study of child vocalization on a parallel corpus of US and shanghainese toddlers.

[BibT_eX]

[DOI]

Qian Zhang

Pongtep Angkititrakul

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Supervector pre-processing for PRSVM-based Chinese and Arabic dialect identification.

[BibT_eX]

[DOI]

Qian Zhang

Proceedings of the IEEE International Conference on Acoustics, 2013

An investigation on back-end for speaker recognition in multi-session enrollment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

CRSS systems for 2012 NIST Speaker Recognition Evaluation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

TEO-based speaker stress assessment using hybrid classification and tracking schemes.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2012

Arabic Dialect Identification - 'Is the Secret in the Silence?' and Other Observations.

[BibT_eX]

[DOI]

Abhijeet Sangwan

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A comparison of front-end compensation strategies for robust LVCSR under room reverberation and increased vocal effort.

[BibT_eX]

[DOI]

Seyed Omid Sadjadi

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A multi-modal highlight extraction scheme for sports videos using an information-theoretic excitability measure.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Front-End Compensation Methods for LVCSR Under Lombard Effect.

[BibT_eX]

[DOI]

Frantisek Grézl

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

UT-Scope: Towards LVCSR under Lombard effect induced by varying types and levels of noisy background.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2010

Analysis and detection of cognitive load and frustration in drivers' speech.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Automatic excitement-level detection for sports highlights generation.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Dialect distance assessment method based on comparison of pitch pattern statistical models.

[BibT_eX]

[DOI]

Mahnoosh Mehrabani

Proceedings of the IEEE International Conference on Acoustics, 2010

Limited resource speech recognition for Nigerian English.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Reduced complexity equalization of lombard effect for speech recognition in noisy adverse environments.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Assessment of speech dialog systems using multi-modal cognitive load analysis and driving performance metrics.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2009

Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2009

2007

Two-stage system for robust neutral/lombard speech recognition.

[BibT_eX]

[DOI]

Petr Fousek

Harald Höge

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006

Methodology of Lombard Speech Database Acquisition: Experiences with CLSD.

[BibT_eX]

[DOI]

Tomás Boril

Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Data-driven design of front-end filter bank for Lombard speech recognition.

[BibT_eX]

[DOI]

Petr Fousek

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005

Design and collection of Czech Lombard speech database.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Direct time domain fundamental frequency estimation of speech in noisy conditions.

[BibT_eX]

[DOI]