Daniel Tihelka

IEEE ACM Trans. Audio Speech Lang. Process., 2024

Sentences vs Phrases in Neural Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

Zero-Shot vs. Few-Shot Multi-speaker TTS Using Pre-trained Czech SpeechT5 Model.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

2023

VITS: Quality Vs. Speed Analysis.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

Ensemble of Deep Neural Network Models for MOS Prediction.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

VITS, Tacotron or FastSpeech? Challenging Some of the Most Popular Synthesizers.

[BibT_eX]

[DOI]

Alice Tihelková

Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

2022

On Comparison of Phonetic Representations for Czech Neural Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

Sequence-to-Sequence CNN-BiLSTM Based Glottal Closure Instant Detection from Raw Speech.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks in Pattern Recognition, 2022

2021

How Much End-to-End is Tacotron 2 End-to-End TTS System.

[BibT_eX]

[DOI]

Alice Tihelková

Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

Save Your Voice: Voice Banking and TTS for Anyone.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

T5G2P: Using Text-to-Text Transfer Transformer for Grapheme-to-Phoneme Conversion.

[BibT_eX]

[DOI]

Markéta Rezácková

Jan Svec

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Comparison of Convolutional Neural Networks for Glottal Closure Instant Detection from Raw Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Dialogue act based expressive speech synthesis in limited domain for the Czech language.

[BibT_eX]

[DOI]

Informatica (Slovenia), 2020

Speaker-Dependent BiLSTM-Based Phrasing.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue, 2020

Grappling with Web Technologies: The Problems of Remote Speech Recording.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 22nd International Conference, 2020

Uncertainty of Phone Voicing and Its Impact on Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 22nd International Conference, 2020

2019

Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development.

[BibT_eX]

[DOI]

Lang. Resour. Evaluation, 2019

LSTM-Based Speech Segmentation for TTS Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019

Unified Language-Independent DNN-Based G2P Converter.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Using Extreme Gradient Boosting to Detect Glottal Closure Instants in Speech Signal.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

On the Extension of the Formal Prosody Model for TTS.

[BibT_eX]

[DOI]

Jan Volín

Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

WaveNet-Based Speech Synthesis Applied to Czech - A Comparison with the Traditional Synthesis Methods.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

First Steps Towards Hybrid Speech Synthesis in Czech TTS System ARTIC.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 20th International Conference, 2018

Design and Development of Speech Corpora for Air Traffic Control Training.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Glottal Closure Instant Detection from Speech Signal Using Voting Classifier and Recursive Feature Elimination.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Anomaly-based annotation error detection in speech-synthesis corpora.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2017

Last Syllable Unit Penalization in Unit Selection TTS.

[BibT_eX]

[DOI]

Radek Skarnitzl

Proceedings of the Text, Speech, and Dialogue - 20th International Conference, 2017

Annotation Error Detection: Anomaly Detection vs. Classification.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 19th International Conference, 2017

Classification-Based Detection of Glottal Closure Instants from Speech Signals.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Voice Conservation and TTS System for People Facing Total Laryngectomy.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

WebSubDub - Experimental System for Creating High-Quality Alternative Audio Track for TV Broadcasting.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

On the Influence of the Number of Anomalous and Normal Examples in Anomaly-Based Annotation Errors Detection.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Difficulties with Wh-Questions in Czech TTS System.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Experiments with One-Class Classifier as a Predictor of Spectral Discontinuities in Unit Concatenation.

[BibT_eX]

[DOI]

Martin Gruber

Proceedings of the Speech and Computer - 18th International Conference, 2016

Designing High-Coverage Multi-level Text Corpus for Non-professional-voice Conservation.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 18th International Conference, 2016

Voting Detector: A Combination of Anomaly Detectors to Reveal Annotation Errors in TTS Corpora.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015

Speech Corpus Preparation for Voice Banking of Laryngectomised Patients.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Anomaly-based annotation errors detection in TTS corpora.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014

Modelling F0 Dynamics in Unit Selection Based Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Tuning Limited Domain Speech Synthesis Using General Text-to-Speech System.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Minimum Text Corpus Selection for Limited Domain Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

2013

Robust Methodology for TTS Enhancement Evaluation.

[BibT_eX]

[DOI]

Martin Gruber

Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

SVM-Based Detection of Misannotated Words in Read Speech Corpora.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Configuring TTS Evaluation Method Based on Unit Cost Outlier Detection.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Experiments on Reducing Footprint of Unit Selection TTS System.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Is unit selection aware of audible artifacts?

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Annotation errors detection in TTS corpora.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012

On the Impact of Annotation Errors on Unit-Selection Speech Synthesis.

[BibT_eX]

[DOI]

Lubos Smídl

Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

On the impact of labialization contexts on unit selection speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2012

2011

On the detection of pitch marks using a robust multi-phase algorithm.

[BibT_eX]

[DOI]

Speech Commun., 2011

Generalized Non-uniform Time Scaling Distribution Method for Natural-Sounding Speech Rate Change.

[BibT_eX]

[DOI]

Martin Méner

Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

2010

Enhancements of viterbi search for fast unit selection synthesis.

[BibT_eX]

[DOI]

Jirí Kala

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009

First Experiments on Text-to-Speech System Personification.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

Exploring automatic similarity measures for unit selection tuning.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Language Resources and Evaluation, 2008

2007

Quality Deterioration Factors in Unit Selection Speech Synthesis.

[BibT_eX]

[DOI]

Jirí Kala

Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Pitch Marks at Peaks or Valleys?

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Evaluation of various unit types in the unit selection approach for the Czech language using the Festival system.

[BibT_eX]

[DOI]

Martin Gruber

Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

A robust multi-phase pitch-mark detection algorithm.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006

Diphones vs. Triphones in Czech Unit Selection TTS.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Current State of Czech Text-to-Speech System ARTIC.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Unit selection and its relation to symbolic prosody: a new approach.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005

Symbolic prosody driven unit selection for highly natural synthetic speech.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Hybrid syllable/triphone speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Advanced Prosody Modelling.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Slovak Text-to-Speech Synthesis in ARTIC System.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

The Design of Czech Language Formal Listening Tests for the Evaluation of TTS Systems.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Recent improvements on ARTIC: czech text-to-speech system.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

Experiments with Automatic Segmentation for Czech Speech Synthesis.

[BibT_eX]

[DOI]

Josef Psutka

Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

Sentence boundary detection in Czech TTS system using neural networks.

[BibT_eX]

[DOI]

Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003

Automatic segmentation for czech concatenative speech synthesis using statistical approach with boundary-specific correction.

[BibT_eX]

[DOI]