Ales Prazák

Orcid: 0000-0001-9453-0034

According to our database1, Ales Prazák authored at least 34 papers between 2006 and 2022.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2022
Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech.
CoRR, 2022

Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
CNN-TDNN-Based Architecture for Speech Recognition Using Grapheme Models in Bilingual Czech-Slovak Task.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

Recognition of Heavily Accented and Emotional Speech of English and Czech Holocaust Survivors Using Various DNN Architectures.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Various DNN-HMM Architectures Used in Acoustic Modeling with Single-Speaker and Single-Channel.
Proceedings of the Statistical Language and Speech Processing, 2021

Spoken Term Detection and Relevance Score Estimation Using Dot-Product of Pronunciation Embeddings.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Live TV Subtitling Through Respeaking.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
Live TV subtitling through respeaking with remote cutting-edge technology.
Multim. Tools Appl., 2020

Complexity of the TDNN Acoustic Model with Respect to the HMM Topology.
Proceedings of the Text, Speech, and Dialogue, 2020

2018
Online LDA-Based Language Model Adaptation.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

First Insight into the Processing of the Language Consulting Center Data.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Towards Processing of the Oral History Interviews and Related Printed Documents.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Multimodal Name Recognition in Live TV Subtitling.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2014
General framework for mining, processing and storing large amounts of electronic texts for language modeling purposes.
Lang. Resour. Evaluation, 2014

Captioning of Live TV Commentaries from the Olympic Games in Sochi: Some Interesting Insights.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

2013
Online Speaker Adaptation of an Acoustic Model Using Face Recognition.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Towards Live Subtitling of TV Ice-hockey Commentary.
Proceedings of the SIGMAP and WINSYS 2013, 2013

2012
Neural Network Language Model with Cache.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Captioning of Live TV Programs through Speech Recognition and Re-speaking.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive.
EURASIP J. Audio Speech Music. Process., 2011

Automatic Topic Identification for Large Scale Language Modeling Data Filtering.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Four-phase Re-speaker Training System.
Proceedings of the SIGMAP 2011, 2011

2010
Online TV Captioning of Czech Parliamentary Sessions.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Fast Phonetic/Lexical Searching in the Archives of the Czech Holocaust Testimonies: Advancing Towards the MALACH Project Visions.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

2009
Discriminative Training of Gender-Dependent Acoustic Models.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

Training of Speaker-clustered Acoustic Models for use in Real-time Recognizers.
Proceedings of the SIGMAP 2009, 2009

Fast Speaker Adaptation in Automatic Online Subtitling.
Proceedings of the SIGMAP 2009, 2009

2007
Searching for a Robust MFCC-Based Parameterization for ASR Application.
Proceedings of the SIGMAP 2007, 2007

Live TV Subtitling - Fast 2-pass LVCSR System for Online Subtitling.
Proceedings of the SIGMAP 2007, 2007

2006
Automatic Online Subtitling of the Czech Parliament Meetings.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Benefit of a Class-based Language Model for Real-time Closed-captioning of TV Ice-hockey Commentaries.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Adaptive language model in automatic online subtitling.
Proceedings of the Second IASTED International Conference on Computational Intelligence, 2006


  Loading...