Ondrej Klejch

CoRR, 2024

TTSDS - Text-to-Speech Distribution Score.

[BibT_eX]

[DOI]

CoRR, 2024

Exploring Dominant Paths in CTC-Like ASR Models: Unraveling the Effectiveness of Viterbi Decoding.

[BibT_eX]

[DOI]

Zeyu Zhao

Proceedings of the IEEE International Conference on Acoustics, 2024

Speech Collage: Code-Switched Audio Generation by Collaging Monolingual Corpora.

[BibT_eX]

[DOI]

Shammur Absar Chowdhury

Ahmed Ali

Shinji Watanabe

Sanjeev Khudanpur

Proceedings of the IEEE International Conference on Acoustics, 2024

UnMute Toolkit: Speech Interactions Designed With Minoritised Language Speakers.

[BibT_eX]

[DOI]

Thomas Reitmaier

Dani Kalarikalayil Raju

Proceedings of the ACM Conversational User Interfaces 2024, 2024

Cultivating Spoken Language Technologies for Unwritten Languages.

[BibT_eX]

[DOI]

Thomas Reitmaier

Dani Kalarikalayil Raju

Proceedings of the CHI Conference on Human Factors in Computing Systems, 2024

2023

Acoustic Word Embeddings for Untranscribed Target Languages with Continued Pretraining and Learned Pooling.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Evaluating and reducing the distance between synthetic and real speech distributions.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Comparing Self-Supervised Pre-Training and Semi-Supervised Training for Speech Recognition in Languages with Weak Language Models.

[BibT_eX]

[DOI]

Léa-Marie Lam-Yee-Mui

Lucas Ondel Yang

Cassia Valentini-Botinhao

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Towards Zero-Shot Code-Switched Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Efficient Intelligibility Evaluation Using Keyword Spotting: A Study on Audio-Visual Speech Enhancement.

[BibT_eX]

[DOI]

Andrea Lorena Aldana Blanco

Andrea Lorena Aldana Blanco

Proceedings of the IEEE International Conference on Acoustics, 2023

The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Situating Automatic Speech Recognition Development within Communities of Under-heard Language Speakers.

[BibT_eX]

[DOI]

Léa-Marie Lam-Yee-Mui

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022

AVSE Challenge: Audio-Visual Speech Enhancement Challenge.

[BibT_eX]

[DOI]

Cassia Valentini-Botinhao

Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Opportunities and Challenges of Automatic Speech Recognition Systems for Low-Resource Language Speakers.

[BibT_eX]

[DOI]

Thomas Reitmaier

Dani Kalarikalayil Raju

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021

Mask-combine Decoding and Classification Approach for Punctuation Prediction with real-time Inference Constraints.

[BibT_eX]

[DOI]

CoRR, 2021

On the Learning Dynamics of Semi-Supervised Training for ASR.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

The CSTR System for Multilingual and Code-Switching ASR Challenges for Low Resource Indian Languages.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

European Language Grid: A Joint Platform for the European Language Technology Community.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

2020

Learning to adapt: meta-learning approaches for speaker adaptation

[BibT_eX]

[DOI]

PhD thesis, 2020

Adaptation Algorithms for Speech Recognition: An Overview.

[BibT_eX]

[DOI]

CoRR, 2020

European Language Grid: An Overview.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Ava Active Speaker: An Audio-Visual Dataset for Active Speaker Detection.

[BibT_eX]

[DOI]

Arkadiusz Stopczynski

Cordelia Schmid

Zhonghua Xi

Caroline Pantofaru

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Lattice-Based Unsupervised Test-Time Adaptation of Neural Network Acoustic Models.

[BibT_eX]

[DOI]

CoRR, 2019

AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection.

[BibT_eX]

[DOI]

Arkadiusz Stopczynski

Cordelia Schmid

Zhonghua Xi

Caroline Pantofaru

CoRR, 2019

Lattice-Based Lightly-Supervised Acoustic Model Training.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Supplementary Material: AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection.

[BibT_eX]

[DOI]

Arkadiusz Stopczynski

Cordelia Schmid

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Speaker Adaptive Training Using Model Agnostic Meta-Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Acoustic Model Adaptation from Raw Waveforms with Sincnet.

[BibT_eX]

[DOI]

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Experiments with Cross-Language Speech Retrieval for Lower-Resource Languages.

[BibT_eX]

[DOI]

Proceedings of the Information Retrieval Technology, 2019

2018

Learning to Adapt: A Meta-learning Approach for Speaker Adaptation.

[BibT_eX]

[DOI]

Joachim Fainberg

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Sequence-to-sequence models for punctuated transcription combining lexical and acoustic features.

[BibT_eX]

[DOI]

Steve Renals

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

The SUMMA Platform Prototype.

[BibT_eX]

[DOI]

Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Hierarchical recurrent neural network for story segmentation using fusion of lexical and acoustic features.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Punctuated transcription of multi-genre broadcasts using acoustic and lexical approaches.

[BibT_eX]

[DOI]

Steve Renals

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Tools and Guidelines for Principled Machine Translation Development.

[BibT_eX]

[DOI]

Nora Aranberri

Eleftherios Avramidis

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015

MT-ComparEval: Graphical evaluation interface for Machine Translation development.

[BibT_eX]

[DOI]