Daniel Tihelka

Orcid: 0000-0002-3149-2330

According to our database1, Daniel Tihelka authored at least 73 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
T5G2P: Text-to-Text Transfer Transformer Based Grapheme-to-Phoneme Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Sentences vs Phrases in Neural Speech Synthesis.
Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

Zero-Shot vs. Few-Shot Multi-speaker TTS Using Pre-trained Czech SpeechT5 Model.
Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

2023
VITS: Quality Vs. Speed Analysis.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

Ensemble of Deep Neural Network Models for MOS Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2023

VITS, Tacotron or FastSpeech? Challenging Some of the Most Popular Synthesizers.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

2022
On Comparison of Phonetic Representations for Czech Neural Speech Synthesis.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

Sequence-to-Sequence CNN-BiLSTM Based Glottal Closure Instant Detection from Raw Speech.
Proceedings of the Artificial Neural Networks in Pattern Recognition, 2022

2021
How Much End-to-End is Tacotron 2 End-to-End TTS System.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

Save Your Voice: Voice Banking and TTS for Anyone.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

T5G2P: Using Text-to-Text Transfer Transformer for Grapheme-to-Phoneme Conversion.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Comparison of Convolutional Neural Networks for Glottal Closure Instant Detection from Raw Speech.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Dialogue act based expressive speech synthesis in limited domain for the Czech language.
Informatica (Slovenia), 2020

Speaker-Dependent BiLSTM-Based Phrasing.
Proceedings of the Text, Speech, and Dialogue, 2020

Grappling with Web Technologies: The Problems of Remote Speech Recording.
Proceedings of the Speech and Computer - 22nd International Conference, 2020

Uncertainty of Phone Voicing and Its Impact on Speech Synthesis.
Proceedings of the Speech and Computer - 22nd International Conference, 2020

2019
Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development.
Lang. Resour. Evaluation, 2019

LSTM-Based Speech Segmentation for TTS Synthesis.
Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019

Unified Language-Independent DNN-Based G2P Converter.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Using Extreme Gradient Boosting to Detect Glottal Closure Instants in Speech Signal.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

On the Extension of the Formal Prosody Model for TTS.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

WaveNet-Based Speech Synthesis Applied to Czech - A Comparison with the Traditional Synthesis Methods.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

First Steps Towards Hybrid Speech Synthesis in Czech TTS System ARTIC.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Design and Development of Speech Corpora for Air Traffic Control Training.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Glottal Closure Instant Detection from Speech Signal Using Voting Classifier and Recursive Feature Elimination.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Anomaly-based annotation error detection in speech-synthesis corpora.
Comput. Speech Lang., 2017

Last Syllable Unit Penalization in Unit Selection TTS.
Proceedings of the Text, Speech, and Dialogue - 20th International Conference, 2017

Annotation Error Detection: Anomaly Detection vs. Classification.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Classification-Based Detection of Glottal Closure Instants from Speech Signals.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Voice Conservation and TTS System for People Facing Total Laryngectomy.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

WebSubDub - Experimental System for Creating High-Quality Alternative Audio Track for TV Broadcasting.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
On the Influence of the Number of Anomalous and Normal Examples in Anomaly-Based Annotation Errors Detection.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Difficulties with Wh-Questions in Czech TTS System.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Experiments with One-Class Classifier as a Predictor of Spectral Discontinuities in Unit Concatenation.
Proceedings of the Speech and Computer - 18th International Conference, 2016

Designing High-Coverage Multi-level Text Corpus for Non-professional-voice Conservation.
Proceedings of the Speech and Computer - 18th International Conference, 2016

Voting Detector: A Combination of Anomaly Detectors to Reveal Annotation Errors in TTS Corpora.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Speech Corpus Preparation for Voice Banking of Laryngectomised Patients.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Anomaly-based annotation errors detection in TTS corpora.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Modelling F0 Dynamics in Unit Selection Based Speech Synthesis.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Tuning Limited Domain Speech Synthesis Using General Text-to-Speech System.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Minimum Text Corpus Selection for Limited Domain Speech Synthesis.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

2013
Robust Methodology for TTS Enhancement Evaluation.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

SVM-Based Detection of Misannotated Words in Read Speech Corpora.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Configuring TTS Evaluation Method Based on Unit Cost Outlier Detection.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Experiments on Reducing Footprint of Unit Selection TTS System.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Is unit selection aware of audible artifacts?
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Annotation errors detection in TTS corpora.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
On the Impact of Annotation Errors on Unit-Selection Speech Synthesis.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

On the impact of labialization contexts on unit selection speech synthesis.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2012

2011
On the detection of pitch marks using a robust multi-phase algorithm.
Speech Commun., 2011

Generalized Non-uniform Time Scaling Distribution Method for Natural-Sounding Speech Rate Change.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

2010
Enhancements of viterbi search for fast unit selection synthesis.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
First Experiments on Text-to-Speech System Personification.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

Exploring automatic similarity measures for unit selection tuning.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

2007
Quality Deterioration Factors in Unit Selection Speech Synthesis.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Pitch Marks at Peaks or Valleys?
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Evaluation of various unit types in the unit selection approach for the Czech language using the Festival system.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

A robust multi-phase pitch-mark detection algorithm.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Diphones vs. Triphones in Czech Unit Selection TTS.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Current State of Czech Text-to-Speech System ARTIC.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Unit selection and its relation to symbolic prosody: a new approach.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

2005
Symbolic prosody driven unit selection for highly natural synthetic speech.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Hybrid syllable/triphone speech synthesis.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Advanced Prosody Modelling.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Slovak Text-to-Speech Synthesis in ARTIC System.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

The Design of Czech Language Formal Listening Tests for the Evaluation of TTS Systems.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Recent improvements on ARTIC: czech text-to-speech system.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003
Experiments with Automatic Segmentation for Czech Speech Synthesis.
Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

Sentence boundary detection in Czech TTS system using neural networks.
Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003

Automatic segmentation for czech concatenative speech synthesis using statistical approach with boundary-specific correction.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
German and Czech Speech Synthesis Using HMM-Based Speech Segment Database.
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002


  Loading...