Jindrich Matousek

Orcid: 0000-0002-7408-7730

According to our database1, Jindrich Matousek authored at least 118 papers between 1999 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Using LSTM neural networks for cross-lingual phonetic speech segmentation with an iterative correction procedure.
Comput. Intell., April, 2024

T5G2P: Text-to-Text Transfer Transformer Based Grapheme-to-Phoneme Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Effects of Training Strategies and the Amount of Speech Data on the Quality of Speech Synthesis.
Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

Sentences vs Phrases in Neural Speech Synthesis.
Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

Zero-Shot vs. Few-Shot Multi-speaker TTS Using Pre-trained Czech SpeechT5 Model.
Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

2023
VITS: Quality Vs. Speed Analysis.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

Neural Speech Synthesis with Enriched Phrase Boundaries.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Ensemble of Deep Neural Network Models for MOS Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2023

T5G2P: Multilingual Grapheme-to-Phoneme Conversion with Text-to-Text Transfer Transformer.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

VITS, Tacotron or FastSpeech? Challenging Some of the Most Popular Synthesizers.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

2022
Text-to-Text Transfer Transformer Phrasing Model Using Enriched Text Input.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

On Comparison of Phonetic Representations for Czech Neural Speech Synthesis.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

Phonetic Speech Segmentation of Audiobooks by Using Adapted LSTM-Based Acoustic Models.
Proceedings of the Advances in Artificial Intelligence - IBERAMIA 2022, 2022

Sequence-to-Sequence CNN-BiLSTM Based Glottal Closure Instant Detection from Raw Speech.
Proceedings of the Artificial Neural Networks in Pattern Recognition, 2022

2021
On Comparison of XGBoost and Convolutional Neural Networks for Glottal Closure Instant Detection.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

How Much End-to-End is Tacotron 2 End-to-End TTS System.
Proceedings of the Text, Speech, and Dialogue - 24th International Conference, 2021

Human and Transformer-Based Prosodic Phrasing in Two Speech Genres.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Save Your Voice: Voice Banking and TTS for Anyone.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

A Comparison of Convolutional Neural Networks for Glottal Closure Instant Detection from Raw Speech.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Speech and web-based technology to enhance education for pupils with visual impairment.
J. Multimodal User Interfaces, 2020

Dialogue act based expressive speech synthesis in limited domain for the Czech language.
Informatica (Slovenia), 2020

Synthetic Speech Evaluation by 2D GMM Classification in Pleasure-Arousal Scale.
Proceedings of the 43rd International Conference on Telecommunications and Signal Processing, 2020

Context-Aware XGBoost for Glottal Closure Instant Detection in Speech Signal.
Proceedings of the Text, Speech, and Dialogue, 2020

Synthetic Speech Evaluation by Differential Maps in Pleasure-Arousal Space.
Proceedings of the Speech and Computer - 22nd International Conference, 2020

2019
Air traffic control communication (ATCC) speech corpora and their use for ASR and TTS development.
Lang. Resour. Evaluation, 2019

Artefact Determination by GMM-Based Continuous Detection of Emotional Changes in Synthetic Speech.
Proceedings of the 42nd International Conference on Telecommunications and Signal Processing, 2019

Czech Speech Synthesis with Generative Neural Vocoder.
Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019

Evaluation of Synthetic Speech by GMM-Based Continuous Detection of Emotional States.
Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019

Web-Based Speech Synthesis Editor.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Framework for Conducting Tasks Requiring Human Assessment.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Using Extreme Gradient Boosting to Detect Glottal Closure Instants in Speech Signal.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Evaluation of Synthetic Speech Quality by Statistical Analysis of Voiced and Unvoiced Part Durations.
Proceedings of the 41st International Conference on Telecommunications and Signal Processing, 2018

Current State of Text-to-Speech System ARTIC: A Decade of Research on the Field of Speech Technologies.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

Automatic Evaluation of Synthetic Speech Quality by a System Based on Statistical Analysis.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

First Steps Towards Hybrid Speech Synthesis in Czech TTS System ARTIC.
Proceedings of the Speech and Computer - 20th International Conference, 2018

On the Contribution of Articulatory Features to Speech Synthesis.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Design and Development of Speech Corpora for Air Traffic Control Training.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Glottal Closure Instant Detection from Speech Signal Using Voting Classifier and Recursive Feature Elimination.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

On the Analysis of Training Data for Wavenet-Based Speech Synthesis.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Anomaly-based annotation error detection in speech-synthesis corpora.
Comput. Speech Lang., 2017

Automatic Classification of Types of Artefacts Arising During the Unit Selection Speech Synthesis.
Proceedings of the Text, Speech, and Dialogue - 20th International Conference, 2017

Automatic Phonetic Segmentation Using the Kaldi Toolkit.
Proceedings of the Text, Speech, and Dialogue - 20th International Conference, 2017

Annotation Error Detection: Anomaly Detection vs. Classification.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Classification-Based Detection of Glottal Closure Instants from Speech Signals.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Voice Conservation and TTS System for People Facing Total Laryngectomy.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

WebSubDub - Experimental System for Creating High-Quality Alternative Audio Track for TV Broadcasting.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016
Comparison of one and two-level architecture of the GMM-based speaker age classifier.
Proceedings of the 39th International Conference on Telecommunications and Signal Processing, 2016

Unit-Selection Speech Synthesis Adjustments for Audiobook-Based Voices.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Evaluation of TTS Personification by GMM-Based Speaker Gender and Age Classifier.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

On the Influence of the Number of Anomalous and Normal Examples in Anomaly-Based Annotation Errors Detection.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

GMM-based speaker gender and age classification after voice conversion.
Proceedings of the First International Workshop on Sensing, 2016

Designing High-Coverage Multi-level Text Corpus for Non-professional-voice Conservation.
Proceedings of the Speech and Computer - 18th International Conference, 2016

Voting Detector: A Combination of Anomaly Detectors to Reveal Annotation Errors in TTS Corpora.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

ARET - Automatic Reading of Educational Texts for Visually Impaired Students.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2015
Detection of artefacts in czech synthetic speech based on ANOVA statistics.
Proceedings of the 38th International Conference on Telecommunications and Signal Processing, 2015

Experiment with GMM-Based Artefact Localization in Czech Synthetic Speech.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Detection of Large Segmentation Errors with Score Predictive Model.
Proceedings of the Text, Speech, and Dialogue - 18th International Conference, 2015

Anomaly-based annotation errors detection in TTS corpora.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

2014
Modelling F0 Dynamics in Unit Selection Based Speech Synthesis.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

GMM Classification of Text-to-Speech Synthesis: Identification of Original Speaker's Voice.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Quality Improvements of Zero-Concatenation-Cost Chain Based Unit Selection.
Proceedings of the Speech and Computer - 16th International Conference, 2014

Very fast unit selection using Viterbi search with zero-concatenation-cost chains.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Concatenation Artifact Detection Trained from Listeners Evaluations.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Experiment with Evaluation of Quality of the Synthetic Speech by the GMM Classifier.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

SVM-Based Detection of Misannotated Words in Read Speech Corpora.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Configuring TTS Evaluation Method Based on Unit Cost Outlier Detection.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Experiments on Reducing Footprint of Unit Selection TTS System.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Is unit selection aware of audible artifacts?
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

Improvements in Czech Expressive Speech Synthesis in Limited Domain.
Proceedings of the Speech and Computer - 15th International Conference, 2013

Annotation errors detection in TTS corpora.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012
On the Impact of Annotation Errors on Unit-Selection Speech Synthesis.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

On the impact of labialization contexts on unit selection speech synthesis.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2012

Improving automatic dubbing with subtitle timing optimisation using video cut detection.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
On the detection of pitch marks using a robust multi-phase algorithm.
Speech Commun., 2011

Several Aspects of Machine-Driven Phrasing in Text-to-Speech Systems.
Prague Bull. Math. Linguistics, 2011

Web-Based System for Automatic Reading of Technical Documents for Vision Impaired Students.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Identifying Concatenation Discontinuities by Hierarchical Divisive Clustering of Pitch Contours.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Analysis of Data Collected in Listening Tests for the Purpose of Evaluation of Concatenation Cost Functions.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

2010
Automatic Segmentation of Parasitic Sounds in Speech Corpora for TTS Synthesis.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Collection and Analysis of Data for Evaluation of Concatenation Cost Functions.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Listening-Test-Based Annotation of Communicative Functions for Expressive Speech Synthesis.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Enhancements of viterbi search for fast unit selection synthesis.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Automatic Pitch-Synchronous Phonetic Segmentation with Context-Independent HMMs.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

Design of the Test Stimuli for the Evaluation of Concatenation Cost Functions.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

First Experiments on Text-to-Speech System Personification.
Proceedings of the Text, Speech and Dialogue, 12th International Conference, 2009

Identification and automatic detection of parasitic speech sounds.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008
Building of a Speech Corpus Optimised for Unit Selection TTS Synthesis.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Automatic pitch-synchronous phonetic segmentation.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007
Quality Deterioration Factors in Unit Selection Speech Synthesis.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Recording and Annotation of Speech Corpus for Czech Unit Selection Speech Synthesis.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Pitch Marks at Peaks or Valleys?
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Voice Conversion Based on Probabilistic Parameter Transformation and Extended Inter-speaker Residual Prediction.
Proceedings of the Text, Speech and Dialogue, 10th International Conference, 2007

Evaluation of various unit types in the unit selection approach for the Czech language using the Festival system.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

A robust multi-phase pitch-mark detection algorithm.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

F0 transformation within the voice conversion framework.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006
Design, implementation and evaluation of the Czech realistic audio-visual speech synthesis.
Signal Process., 2006

Diphones vs. Triphones in Czech Unit Selection TTS.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Current State of Czech Text-to-Speech System ARTIC.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

First Steps Towards New Czech Voice Conversion System.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Unit selection and its relation to symbolic prosody: a new approach.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

On building phonetically and prosodically rich speech corpus for text-to-speech synthesis.
Proceedings of the Second IASTED International Conference on Computational Intelligence, 2006

2005
Formal Prosodic Structures and Their Application in NLP.
Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

On Modelling Glottal Stop in Czech Text-to-Speech Synthesis.
Proceedings of the Text, Speech and Dialogue, 8th International Conference, 2005

Hybrid syllable/triphone speech synthesis.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Advanced Prosody Modelling.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Slovak Text-to-Speech Synthesis in ARTIC System.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

The Design of Czech Language Formal Listening Tests for the Evaluation of TTS Systems.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Recent improvements on ARTIC: czech text-to-speech system.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003
Experiments with Automatic Segmentation for Czech Speech Synthesis.
Proceedings of the Text, Speech and Dialogue, 6th International Conference, 2003

Sentence boundary detection in Czech TTS system using neural networks.
Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, 2003

Automatic segmentation for czech concatenative speech synthesis using statistical approach with boundary-specific correction.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
German and Czech Speech Synthesis Using HMM-Based Speech Segment Database.
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

2001
Large broadcast news and read speech corpora of spoken czech.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Design of speech corpus for text-to-speech synthesis.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Building a New Czech Text-to-Speech System Using Triphone-Based Speech Units.
Proceedings of the Text, Speech and Dialogue - Third International Workshop, 2000

ARTIC: a new Czech text-to-speech system using statistical approach to speech segment database construction.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Statistical Approach to the Automatic Synthesis of Czech Speech.
Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999

Speech synthesis using HMM-based acoustic unit inventory.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999


  Loading...