2023
BUT CHiME-7 system description.
CoRR, 2023
2022
Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator.
CoRR, 2022
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications.
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2022
Call-Sign Recognition and Understanding for Noisy Air-Traffic Transcripts Using Surveillance Information.
Proceedings of the IEEE International Conference on Acoustics, 2022
BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 6th International Conference, 2022
2021
Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Detecting English Speech in the Air Traffic Control Voice Communication.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Analysis of X-Vectors for Low-Resource Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge.
Proceedings of the Fifth International Conference, 2021
2020
BUT Opensat 2019 Speech Recognition System.
CoRR, 2020
Automatic Speech Recognition Benchmark for Air-Traffic Communications.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
SoapBox Labs Fluency Assessment Platform for Child Speech.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Soapbox Labs Verification Platform for Child Speech.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
2018
"Semi-supervised" trénování hlubokých neuronových sítí pro rozpoznávání řeči ; Semi-Supervised Training of Deep Neural Networks for Speech Recognition.
PhD thesis, 2018
Residual Memory Networks: Feed-forward approach to learn long temporal dependencies.
CoRR, 2018
Lightly Supervised vs. Semi-supervised Training of Acoustic Model on Luxembourgish for Low-resource Automatic Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
BUT OpenSAT 2017 Speech Recognition System.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
BUT System for DIHARD Speech Diarization Challenge 2018.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Analysis of Multilingual Blstm Acoustic Model on Low and High Resource Languages.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
2017
Multilingually trained bottleneck features in spoken language recognition.
Comput. Speech Lang., 2017
Semi-Supervised DNN Training with Word Selection for ASR.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
2016 BUT Babel System: Multilingual BLSTM Acoustic Model with i-Vector Based Adaptation.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Deep Auto-Encoder Based Multi-Task Learning Using Probabilistic Transcriptions.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017
Residual memory networks: Feed-forward approach to learn long-term temporal dependencies.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017
MGB-3 but system: Low-resource ASR on Egyptian YouTube data.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017
Training Data Augmentation and Data Selection.
Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017
2016
Multilingual BLSTM and speaker-specific vector adaptation in 2016 but babel system.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Sequence summarizing neural network for speaker adaptation.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
Multilingual region-dependent transforms.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
DNN derived filters for processing of modulation spectrum of speech.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Autoencoder based multi-stream combination for noise robust speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
But ASR system for BABEL Surprise evaluation 2014.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Progress in the BBN keyword search system for the DARPA RATS program.
,
,
,
,
,
,
,
,
,
,
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
BUT 2014 Babel system: analysis of adaptation in NN based systems.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Adaptation of multilingual stacked bottle-neck neural network structure for new language.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Sequence-discriminative training of deep neural networks.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Improved feature processing for deep neural networks.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
BUT BABEL system for spontaneous Cantonese.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Manual and semi-automatic approaches to building a multilingual phoneme set.
Proceedings of the IEEE International Conference on Acoustics, 2013
Semi-supervised training of Deep Neural Networks.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
Score normalization and system combination for improved keyword spotting.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013
2012
The language-independent bottleneck features.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
BUT2012 Approaches for Spoken Web Search - MediaEval 2012.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012
Developing a Speech Activity Detection System for the DARPA RATS Program.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Patrol Team Language Identification System for DARPA RATS P1 Evaluation.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Generating exact lattices in the WFST framework.
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012
2011
New system theory and its impact on control theory.
Int. J. Gen. Syst., 2011
Convolutive Bottleneck Network features for LVCSR.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Parallel training of neural networks for speech recognition.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010