Jan Svec

Orcid: 0000-0001-8362-5927

According to our database1, Jan Svec authored at least 80 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.




In proceedings 
PhD thesis 


On csauthors.net:


Asking Questions Framework for Oral History Archives.
Proceedings of the Advances in Information Retrieval, 2024

Asking Questions: an Innovative Way to Interact with Oral History Archives.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Transformer-based Speech Recognition Models for Oral History Archives in English, German, and Czech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Ensemble of Deep Neural Network Models for MOS Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2023

The System for Efficient Indexing and Search in the Large Archives of Scanned Historical Documents.
Proceedings of the Advances in Information Retrieval, 2023

Transformer-Based Encoder-Encoder Architecture for Spoken Term Detection.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

T5G2P: Multilingual Grapheme-to-Phoneme Conversion with Text-to-Text Transfer Transformer.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

Will XAI Provide Real Explanation or Just a Plausible Rationalization?
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

Voice-Interactive Learning Dialogue on a Low-Cost Device.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech.
CoRR, 2022

Evaluation of Wav2Vec Speech Recognition for Speakers with Cognitive Disorders.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

Transfer Learning of Transformers for Spoken Language Understanding.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

Automatic Grammar Correction of Commas in Czech Written Texts: Comparative Study.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

Analysis of Impact of Emotions on Target Speech Extraction and Speech Separation.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Exploring Capabilities of Monolingual Audio Transformers using Large Datasets in Automatic Speech Recognition of Czech.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

BCN2BRNO: ASR System Fusion for Albayzin 2022 Speech to Text Challenge.
Proceedings of the 6th International Conference, 2022

Transformer-Based Automatic Punctuation Prediction and Word Casing Reconstruction of the ASR Output.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

CNN-TDNN-Based Architecture for Speech Recognition Using Grapheme Models in Bilingual Czech-Slovak Task.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

OCR Improvements for Images of Multi-page Historical Documents.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Initial Experiments on Question Answering from the Intrinsic Structure of Oral History Archives.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Comparison of Czech Transformers on Text Classification Tasks.
Proceedings of the Statistical Language and Speech Processing, 2021

Spoken Term Detection and Relevance Score Estimation Using Dot-Product of Pronunciation Embeddings.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

T5G2P: Using Text-to-Text Transfer Transformer for Grapheme-to-Phoneme Conversion.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Live TV Subtitling Through Respeaking.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Automatic Correction of i/y Spelling in Czech ASR Output.
Proceedings of the Text, Speech, and Dialogue, 2020

Adjusting BERT's Pooling Layer for Large-Scale Multi-Label Text Classification.
Proceedings of the Text, Speech, and Dialogue, 2020

An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents.
Proceedings of the Speech and Computer - 22nd International Conference, 2020

BERT-Based Sentiment Analysis Using Distillation.
Proceedings of the Statistical Language and Speech Processing, 2020

Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development.
Lang. Resour. Evaluation, 2019

Question-Answering Dialog System for Large Audiovisual Archives.
Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019

On Using Stateful LSTM Networks for Key-Phrase Detection.
Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019

Increasing DER Hosting Capacity in LV Grids in the Czech Republic in Terms of European Project InterFlex.
Proceedings of the 2019 IEEE PES Innovative Smart Grid Technologies Europe, 2019

Analysis of Smart Technical Measures Impacts on DER and EV Hosting Capacity Increase in LV and MV Grids in the Czech Republic in Terms of European Project InterFlex.
Proceedings of the 2019 IEEE PES Innovative Smart Grid Technologies Europe, 2019

Multimodal Dialog with the MALACH Audiovisual Archive.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Learning to Interrupt the User at the Right Time in Incremental Dialogue Systems.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Towards Network Simplification for Low-Cost Devices by Removing Synapses.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Towards Processing of the Oral History Interviews and Related Printed Documents.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Design and Development of Speech Corpora for Air Traffic Control Training.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

On the Use of Grapheme Models for Searching in Large Spoken Archives.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Analysis of the RNN-Based Spoken Term Detection Training.
Proceedings of the Speech and Computer - 19th International Conference, 2017

A Relevance Score Estimation for Spoken Term Detection Based on RNN-Generated Pronunciation Embeddings.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Combining Textual and Speech Features in the NLI Task Using State-of-the-Art Machine Learning Techniques.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017

Fast Subsequence Matching in Motion Capture Data.
Proceedings of the Advances in Databases and Information Systems, 2017

Building Corpora for Stylometric Research.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

An Automatic Training Tool for Air Traffic Control Training.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

An Engine for Online Video Search in Large Archives of the Holocaust Testimonies.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A Multimodal Dialogue System for Air Traffic Control Trainees Based on Discrete-Event Simulation.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

A study of different weighting schemes for spoken language understanding based on convolutional neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improving Multi-label Document Classification of Czech News Articles.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Slavonic Corpus for Stylometry Research.
Proceedings of the 9th Workshop on Recent Advances in Slavonic Natural Languages Processing, 2015

Hierarchical discriminative model for spoken language understanding based on convolutional neural network.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Word-semantic lattices for spoken language understanding.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

General framework for mining, processing and storing large amounts of electronic texts for language modeling purposes.
Lang. Resour. Evaluation, 2014

WISE 2014 Challenge: Multi-label Classification of Print Media Articles to Topics.
Proceedings of the Web Information Systems Engineering - WISE 2014, 2014

Semi-supervised Learning Algorithm for Binary Relevance Multi-label Classification.
Proceedings of the Web Information Systems Engineering - WISE 2014 Workshops, 2014

Inter-Annotator Agreement on Spontaneous Czech Language - Limits of Automatic Speech Recognition Accuracy.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Two-Layer Semantic Entity Detection and Utterance Validation for Spoken Dialogue Systems.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Semantic Entity Detection in the Spoken Air Traffic Control Data.
Proceedings of the Speech and Computer - 16th International Conference, 2014

Phonetic Spoken Term Detection in Large Audio Archive Using the WFST Framework.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

On the Use of Phoneme Lattices in Spoken Language Understanding.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Improving Speech Recognition by Detecting Foreign Inclusions and Generating Pronunciations.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Hierarchical discriminative model for spoken language understanding.
Proceedings of the IEEE International Conference on Acoustics, 2013

Efficient algorithm for rational kernel evaluation in large lattice sets.
Proceedings of the IEEE International Conference on Acoustics, 2013

Semantic entity detection from multiple ASR hypotheses within the WFST framework.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Spoken Dialogue System Design in 3 Weeks.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

Unsupervised Synchronization of Hidden Subtitles with Audio Track Using Keyword Spotting Algorithm.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

System for fast lexical and phonetic spoken term detection in a Czech cultural heritage archive.
EURASIP J. Audio Speech Music. Process., 2011

Web Text Data Mining for Building Large Scale Language Modelling Corpus.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Prototype of Czech Spoken Dialog System with Mixed Initiative for Railway Information Service.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Fast Phonetic/Lexical Searching in the Archives of the Czech Holocaust Testimonies: Advancing Towards the MALACH Project Visions.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Extended Hidden Vector State Parser.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

The Czech Broadcast Conversation Corpus.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

Structural Metadata Annotation of Speech Corpora: Comparing Broadcast News and Broadcast Conversations.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Extension of HVS semantic parser by allowing left-right branching.
Proceedings of the IEEE International Conference on Acoustics, 2008

Parameterization of the Input in Training the HVS Semantic Parser.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Use of Negative Examples in Training the HVS Semantic Model.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Czech spontaneous speech corpus with structural metadata.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
