Ondrej Klejch

Orcid: 0000-0001-5495-967X

According to our database1, Ondrej Klejch authored at least 40 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech Synthesis in ASR.
CoRR, 2024

TTSDS - Text-to-Speech Distribution Score.
CoRR, 2024

Exploring Dominant Paths in CTC-Like ASR Models: Unraveling the Effectiveness of Viterbi Decoding.
Proceedings of the IEEE International Conference on Acoustics, 2024

Speech Collage: Code-Switched Audio Generation by Collaging Monolingual Corpora.
Proceedings of the IEEE International Conference on Acoustics, 2024

UnMute Toolkit: Speech Interactions Designed With Minoritised Language Speakers.
Proceedings of the ACM Conversational User Interfaces 2024, 2024

Cultivating Spoken Language Technologies for Unwritten Languages.
Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

2023
Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Evaluating and reducing the distance between synthetic and real speech distributions.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Comparing Self-Supervised Pre-Training and Semi-Supervised Training for Speech Recognition in Languages with Weak Language Models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Towards Zero-Shot Code-Switched Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Efficient Intelligibility Evaluation Using Keyword Spotting: A Study on Audio-Visual Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR.
Proceedings of the IEEE International Conference on Acoustics, 2023

Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
AVSE Challenge: Audio-Visual Speech Enhancement Challenge.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Opportunities and Challenges of Automatic Speech Recognition Systems for Low-Resource Language Speakers.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021
Mask-combine Decoding and Classification Approach for Punctuation Prediction with real-time Inference Constraints.
CoRR, 2021

On the Learning Dynamics of Semi-Supervised Training for ASR.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

The CSTR System for Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021


2020
Learning to adapt: meta-learning approaches for speaker adaptation
PhD thesis, 2020

Adaptation Algorithms for Speech Recognition: An Overview.
CoRR, 2020


Ava Active Speaker: An Audio-Visual Dataset for Active Speaker Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Lattice-Based Unsupervised Test-Time Adaptation of Neural Network Acoustic Models.
CoRR, 2019

AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection.
CoRR, 2019

Lattice-Based Lightly-Supervised Acoustic Model Training.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Supplementary Material: AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Speaker Adaptive Training Using Model Agnostic Meta-Learning.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Acoustic Model Adaptation from Raw Waveforms with Sincnet.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Experiments with Cross-Language Speech Retrieval for Lower-Resource Languages.
Proceedings of the Information Retrieval Technology, 2019

2018
Learning to Adapt: A Meta-learning Approach for Speaker Adaptation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017


Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Tools and Guidelines for Principled Machine Translation Development.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
MT-ComparEval: Graphical evaluation interface for Machine Translation development.
Prague Bull. Math. Linguistics, 2015

CloudASR: Platform and Service.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015


  Loading...