Hynek Boril

According to our database1, Hynek Boril authored at least 36 papers between 2004 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
GlitchOS: An Open Source Virtual Assistant.
Proceedings of the 14th IEEE Annual Computing and Communication Workshop and Conference, 2024

2018
On the issues of intra-speaker variability and realism in speech, speaker, and language recognition tasks.
Speech Commun., 2018

2017
Deep neural network training for whispered speech recognition using small databases and generative model sampling.
Int. J. Speech Technol., 2017

2016
Generative Modeling of Pseudo-Whisper for Robust Whispered Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Robustness in Speech, Speaker, and Language Recognition: "You've Got to Know Your Limitations".
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
A new front-end for classification of non-speech sounds: a study on human whistle.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Generative modeling of pseudo-target domain adaptation samples for whispered speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Automatic assessment of language background in toddlers through phonotactic and pitch pattern modeling of short vocalizations.
Proceedings of the 4st Workshop on Child, Computer and Interaction, 2014

Model and feature based compensation for whispered speech recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

UT-Vocal Effort II: Analysis and constrained-lexicon recognition of whispered speech.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Multi-modal highlight generation for sports videos using an information-theoretic excitability measure.
EURASIP J. Adv. Signal Process., 2013


Automatic regularization of cross-entropy cost for speaker recognition fusion.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A preliminary study of child vocalization on a parallel corpus of US and shanghainese toddlers.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Supervector pre-processing for PRSVM-based Chinese and Arabic dialect identification.
Proceedings of the IEEE International Conference on Acoustics, 2013

An investigation on back-end for speaker recognition in multi-session enrollment.
Proceedings of the IEEE International Conference on Acoustics, 2013

CRSS systems for 2012 NIST Speaker Recognition Evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
TEO-based speaker stress assessment using hybrid classification and tracking schemes.
Int. J. Speech Technol., 2012

Arabic Dialect Identification - 'Is the Secret in the Silence?' and Other Observations.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A comparison of front-end compensation strategies for robust LVCSR under room reverberation and increased vocal effort.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A multi-modal highlight extraction scheme for sports videos using an information-theoretic excitability measure.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Front-End Compensation Methods for LVCSR Under Lombard Effect.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

UT-Scope: Towards LVCSR under Lombard effect induced by varying types and levels of noisy background.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments.
IEEE Trans. Speech Audio Process., 2010

Analysis and detection of cognitive load and frustration in drivers' speech.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Automatic excitement-level detection for sports highlights generation.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Dialect distance assessment method based on comparison of pitch pattern statistical models.
Proceedings of the IEEE International Conference on Acoustics, 2010

Limited resource speech recognition for Nigerian English.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Reduced complexity equalization of lombard effect for speech recognition in noisy adverse environments.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Assessment of speech dialog systems using multi-modal cognitive load analysis and driving performance metrics.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2009

Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environment.
Proceedings of the IEEE International Conference on Acoustics, 2009

2007
Two-stage system for robust neutral/lombard speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Methodology of Lombard Speech Database Acquisition: Experiences with CLSD.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Data-driven design of front-end filter bank for Lombard speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005
Design and collection of Czech Lombard speech database.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Direct time domain fundamental frequency estimation of speech in noisy conditions.
Proceedings of the 2004 12th European Signal Processing Conference, 2004


  Loading...