Vincent Colotte

According to our database1, Vincent Colotte authored at least 35 papers between 1998 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



Synthèse de gestes communicatifs via STARGATE.
Proceedings of the Actes des 35èmes Journées d'Études sur la Parole, 2024

Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Can We Use Common Voice to Train a Multi-Speaker TTS System?
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Analysis of expressivity transfer in non-autoregressive end-to-end multispeaker TTS systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Multi-stage attention for fine-grained expressivity transfer in multispeaker text-to-speech system.
Proceedings of the 30th European Signal Processing Conference, 2022

Learning emotions latent representation with CVAE for text-driven expressive audiovisual speech synthesis.
Neural Networks, 2021

Duration modelling and evaluation for Arabic statistical parametric speech synthesis.
Multim. Tools Appl., 2021

Improving transfer of expressivity for end-to-end multispeaker text-to-speech synthesis.
Proceedings of the 29th European Signal Processing Conference, 2021

Some consideration on expressive audiovisual speech corpus acquisition using a multimodal platform.
Lang. Resour. Evaluation, 2020

Étude comparative des paramètres d'entrée pour la synthèse expressive audiovisuelle de la parole par DNNs (Comparative study of input parameters for DNN-based expressive audiovisual speech synthesis ).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Deep Variational Metric Learning for Transfer of Expressivity in Multispeaker Text to Speech.
Proceedings of the Statistical Language and Speech Processing, 2020

Transfer Learning of the Expressivity Using FLOW Metric Learning in Multispeaker Text-to-Speech Synthesis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Conditional Variational Auto-Encoder for Text-Driven Expressive AudioVisual Speech Synthesis.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

\(F_{0}\) Modeling Using DNN for Arabic Parametric Speech Synthesis.
Proceedings of the Recent Advances in Big Data and Deep Learning, 2019

Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic.
Int. J. Speech Technol., 2018

DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation.
Proceedings of the Statistical Language and Speech Processing, 2018

On the quality of an expressive audiovisual corpus: a case study of acted speech.
Proceedings of the 14th International Conference on Auditory-Visual Speech Processing, 2017

The IFCASL Corpus of French and German Non-native and Native Read Speech.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Acoustic and Visual Analysis of Expressive Speech: A Case Study of French Acted Speech.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Acoustic-visual synthesis technique using bimodal unit-selection.
EURASIP J. Audio Speech Music. Process., 2013

Automatic feature selection for acoustic-visual concatenative speech synthesis: towards a perceptual objective measure.
Proceedings of the Auditory-Visual Speech Processing, 2013

ViSAC: acoustic-visual speech synthesis: the system and its evaluation.
Proceedings of the Facial Analysis and Animation 2012, 2012

Weight Optimization for Bimodal Unit-Selection Talking Head Synthesis.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Introducing visual target cost within an acoustic-visual unit-selection speech synthesizer.
Proceedings of the Auditory-Visual Speech Processing, 2011

Setup for acoustic-visual speech synthesis by concatenating bimodal units.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

HMM-based automatic visual speech segmentation using facial data.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Towards a true acoustic-visual speech synthesis.
Proceedings of the Auditory-Visual Speech Processing, 2010

Linguistic features weighting for a text-to-speech system without prosody model.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Techniques d'analyse et de synthèse de la parole appliquées à l'apprentissage des langues.
PhD thesis, 2002

Higher precision pitch marking for TD-PSOLA.
Proceedings of the 11th European Signal Processing Conference, 2002

Perceptual experiments on enhanced and slowed down speech sentences for second language acquisition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Automatic enhancement of speech intelligibility.
Proceedings of the IEEE International Conference on Acoustics, 2000

Detecting relevant acoustic events for piloting improvement of intelligibility.
Proceedings of the 10th European Signal Processing Conference, 2000

Automatic pitch marking for speech transformations via TD-PSOLA.
Proceedings of the 9th European Signal Processing Conference, 1998
