Igor Szöke

CoRR, 2022

Call-Sign Recognition and Understanding for Noisy Air-Traffic Transcripts Using Surveillance Information.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge.

[DOI]

Proceedings of the 6th International Conference, 2022

2021

Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Detecting English Speech in the Air Traffic Control Voice Communication.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Boosting of Contextual Information in ASR for Air-Traffic Call-Sign Recognition.

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Analysis of X-Vectors for Low-Resource Speech Recognition.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge.

[DOI]

Proceedings of the Fifth International Conference, 2021

2020

BUT Opensat 2019 Speech Recognition System.

[DOI]

CoRR, 2020

Automatic Speech Recognition Benchmark for Air-Traffic Communications.

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020.

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

SoapBox Labs Fluency Assessment Platform for Child Speech.

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Soapbox Labs Verification Platform for Child Speech.

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2018

"Semi-supervised" trénování hlubokých neuronových sítí pro rozpoznávání řeči ; Semi-Supervised Training of Deep Neural Networks for Speech Recognition.

[DOI]

PhD thesis, 2018

Residual Memory Networks: Feed-forward approach to learn long temporal dependencies.

[DOI]

CoRR, 2018

Lightly Supervised vs. Semi-supervised Training of Acoustic Model on Luxembourgish for Low-resource Automatic Speech Recognition.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

BUT OpenSAT 2017 Speech Recognition System.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

BUT System for DIHARD Speech Diarization Challenge 2018.

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Analysis of Multilingual Blstm Acoustic Model on Low and High Resource Languages.

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Multilingually trained bottleneck features in spoken language recognition.

[DOI]

Comput. Speech Lang., 2017

Semi-Supervised DNN Training with Word Selection for ASR.

[DOI]

Lukás Burget

Jan Cernocký

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016 BUT Babel System: Multilingual BLSTM Acoustic Model with i-Vector Based Adaptation.

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Deep Auto-Encoder Based Multi-Task Learning Using Probabilistic Transcriptions.

[DOI]

Amit Das

Mark Hasegawa-Johnson

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Residual memory networks: Feed-forward approach to learn long-term temporal dependencies.

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

MGB-3 but system: Low-resource ASR on Egyptian YouTube data.

[DOI]

Mireia Díez

Karel Benes

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Training Data Augmentation and Data Selection.

[DOI]

Proceedings of the New Era for Robust Speech Recognition, Exploiting Deep Learning., 2017

2016

Multilingual BLSTM and speaker-specific vector adaptation in 2016 but babel system.

[DOI]

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training.

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Sequence summarizing neural network for speaker adaptation.

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Multilingual region-dependent transforms.

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

DNN derived filters for processing of modulation spectrum of speech.

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Autoencoder based multi-stream combination for noise robust speech recognition.

[DOI]

Sri Harish Reddy Mallidi

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Towards machines that know when they do not know: Summary of work done at 2014 Frederick Jelinek Memorial Workshop.

[DOI]

Sri Harish Reddy Mallidi

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

But ASR system for BABEL Surprise evaluation 2014.

[DOI]

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Progress in the BBN keyword search system for the DARPA RATS program.

[DOI]

Sri Harish Reddy Mallidi

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

BUT 2014 Babel system: analysis of adaptation in NN based systems.

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Adaptation of multilingual stacked bottle-neck neural network structure for new language.

[DOI]

Frantisek Grézl

Proceedings of the IEEE International Conference on Acoustics, 2014

2013

Sequence-discriminative training of deep neural networks.

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Improved feature processing for deep neural networks.

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

BUT BABEL system for spontaneous Cantonese.

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Manual and semi-automatic approaches to building a multilingual phoneme set.

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

Semi-supervised training of Deep Neural Networks.

[DOI]

Mirko Hannemann

Lukás Burget

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Score normalization and system combination for improved keyword spotting.

[DOI]

Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2012

The language-independent bottleneck features.

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

BUT2012 Approaches for Spoken Web Search - MediaEval 2012.

[DOI]

Igor Szöke

Michal Fapso

Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Developing a Speech Activity Detection System for the DARPA RATS Program.

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Patrol Team Language Identification System for DARPA RATS P1 Evaluation.

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Generating exact lattices in the WFST framework.

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

New system theory and its impact on control theory.

[DOI]

Int. J. Gen. Syst., 2011

Convolutive Bottleneck Network features for LVCSR.

[DOI]

Frantisek Grézl

Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010

Parallel training of neural networks for speech recognition.

[DOI]