Gérard Bailly
Orcid: 0000-0002-6053-0818Affiliations:
- CNRS, Grenoble, France
According to our database1,
Gérard Bailly
authored at least 184 papers
between 1986 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on id.loc.gov
-
on d-nb.info
-
on gipsa-lab.fr
On csauthors.net:
Bibliography
2025
Refining the evaluation of speech synthesis: A summary of the Blizzard Challenge 2023.
Comput. Speech Lang., 2025
2024
THERADIA WoZ: An Ecological Corpus for Appraisal-based Affect Research in Healthcare.
CoRR, 2024
Entraînement de la coordination respiration-parole en apprentissage de la lecture assistée par ordinateur.
Proceedings of the Actes des 35èmes Journées d'Études sur la Parole, 2024
EVAC 2024 - Empathic Virtual Agent Challenge: Appraisal-based Recognition of Affective States.
Proceedings of the 26th International Conference on Multimodal Interaction, 2024
Impact of verbal instructions and deictic gestures of a cobot on the performance of human coworkers.
Proceedings of the 23rd IEEE-RAS International Conference on Humanoid Robots, 2024
Proceedings of the Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, 2024
RoboTrio2: Annotated Interactions of a Teleoperated Robot and Human Dyads for Data-Driven Behavioral Models.
Proceedings of the Workshops at the Third International Conference on Hybrid Human-Artificial Intelligence co-located with (HHAI 2024), 2024
Emotags: Computer-Assisted Verbal Labelling of Expressive Audiovisual Utterances for Expressive Multimodal TTS.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
2023
Dataset, January, 2023
Local Style Tokens: Fine-Grained Prosodic Representations For TTS Expressive Control.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023
Data-Driven Generation of Eyes and Head Movements of a Social Robot in Multiparty Conversation.
Proceedings of the Social Robotics - 15th International Conference, 2023
On the Benefit of Independent Control of Head and Eye Movements of a Social Robot for Multiparty Human-Robot Interaction.
Proceedings of the Human-Computer Interaction, 2023
Proceedings of the 18th Blizzard Challenge Workshop, Grenoble, France, August 29, 2023, 2023
Proceedings of the 18th Blizzard Challenge Workshop, Grenoble, France, August 29, 2023, 2023
2022
Dataset, December, 2022
Dataset, November, 2022
Comparing NLP Solutions for the Disambiguation of French Heterophonic Homographs for End-to-End TTS Systems.
Proceedings of the Speech and Computer - 24th International Conference, 2022
Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022
Speaking Rate Control of end-to-end TTS Models by Direct Manipulation of the Encoder's Output Embeddings.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2021
Dataset, March, 2021
Proceedings of the 11th ISCA Speech Synthesis Workshop, 2021
Proceedings of the Social Robotics - 13th International Conference, 2021
Evaluating the Extrapolation Capabilities of Neural Vocoders to Extreme Pitch Values.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Characterizing and Assessing the Oral Reading Fluency of Young Readers.
Proceedings of the Fifth International Conference, 2021
Proceedings of the Advances in Neuroergonomics and Cognitive Engineering, 2021
2020
Predicting Multidimensional Subjective Ratings of Children' Readings from the Speech Signals for the Automatic Assessment of Fluency.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
2019
Proceedings of the 8th ISCA International Workshop on Speech and Language Technology in Education, 2019
Proceedings of the 11th International Conference on Agents and Artificial Intelligence, 2019
2018
Introduction to the special issue on auditory-visual expressive speech and gesture in humans and machines.
Speech Commun., 2018
Audio-visual synchronization in reading while listening to texts: Effects on visual behavior and verbal learning.
Comput. Speech Lang., 2018
CoRR, 2018
CoRR, 2018
Proceedings of the Fifth International Conference on Social Networks Analysis, 2018
A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic Contours.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Comparing Cascaded LSTM Architectures for Generating Head Motion from Speech in Task-Oriented Dialogs.
Proceedings of the Human-Computer Interaction. Interaction Technologies, 2018
2017
Speech Commun., 2017
Learning off-line vs. on-line models of interactive multimodal behaviors with recurrent neural networks.
Pattern Recognit. Lett., 2017
J. Multimodal User Interfaces, 2017
IEEE Computer Graphics and Applications, 2017
Evaluation of reading performance of primary school children: Objective measurements vs. subjective ratings.
Proceedings of the 6th International Workshop on Child Computer Interaction, 2017
Improving fluency of young readers: introducing a Karaoke to learn how to breathe during a Reading-while-Listening task.
Proceedings of the 7th ISCA International Workshop on Speech and Language Technology in Education, 2017
Proceedings of the Companion of the 2017 ACM/IEEE International Conference on Human-Robot Interaction, 2017
2016
Pattern Recognit. Lett., 2016
Statistical conversion of silent articulation into audible speech using full-covariance HMM.
Comput. Speech Lang., 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Quantitative Analysis of Backchannels Uttered by an Interviewer During Neuropsychological Tests.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the 7th IEEE International Conference on Cognitive Infocommunications, 2016
2015
Speaker-Adaptive Acoustic-Articulatory Inversion Using Cascaded Gaussian Mixture Regression.
IEEE ACM Trans. Audio Speech Lang. Process., 2015
J. Multimodal User Interfaces, 2015
Int. J. Humanoid Robotics, 2015
Using Karaoke to enhance reading while listening: impact on word memorization and eye movements.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015
Impact of iris size and eyelids coupling on the estimation of the gaze direction of a robotic talking head by human viewers.
Proceedings of the 15th IEEE-RAS International Conference on Humanoid Robots, 2015
Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction, 2015
Proceedings of the Auditory-Visual Speech Processing, 2015
2014
Proceedings of the Seventh International Conference on Motion in Games, Playa Vista, CA, USA, November 06, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the 14th IEEE-RAS International Conference on Humanoid Robots, 2014
Modeling perception-action loops: comparing sequential models with frame-based classifiers.
Proceedings of the second international conference on Human-agent interaction, 2014
2013
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2013
Speaker adaptation of an acoustic-articulatory inversion model using cascaded Gaussian mixture regressions.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the Human Behavior Understanding - 4th International Workshop, 2013
Proceedings of the Auditory-Visual Speech Processing, 2013
2012
I Reach Faster When I See You Look: Gaze Effects in Human-Human and Human-Robot Face-to-Face Cooperation.
Frontiers Neurorobotics, 2012
Vizart3D : Retour Articulatoire Visuel pour l'Aide à la Prononciation (Vizart3D: Visual Articulatory Feedack for Computer-Assisted Pronunciation Training) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012
Cross-speaker Acoustic-to-Articulatory Inversion using Phone-based Trajectory HMM for Pronunciation Training.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Continuous Articulatory-to-Acoustic Mapping using Phone-based Trajectory HMM for a Silent Speech Interface.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
2011
A pilot study on augmented speech communication based on Electro-Magnetic Articulography.
Pattern Recognit. Lett., 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
2010
Can you 'read' tongue movements? Evaluation of the contribution of tongue display to speech understanding.
Speech Commun., 2010
Proceedings of the 3rd international workshop on Affective interaction in natural environments, 2010
Proceedings of the 3rd international workshop on Affective interaction in natural environments, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010
Speech, Gaze and Head Motion in a Face-to-Face Collaborative Task.
Proceedings of the Electronic Speech Signal Processing, 2010
Proceedings of the Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues, 2010
Proceedings of the Auditory-Visual Speech Processing, 2010
2009
EURASIP J. Audio Speech Music. Process., 2009
EURASIP J. Audio Speech Music. Process., 2009
Acoustic-to-articulatory inversion using speech recognition and trajectory formation based on phoneme hidden Markov models.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
A trainable trajectory formation model TD-HMM parameterized for the LIPS 2008 challenge.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008
An Audiovisual Talking Head for Augmented Speech Generation: Models and Animations Based on a Real Speaker's Articulatory Data.
Proceedings of the Articulated Motion and Deformable Objects, 5th International Conference, 2008
2007
Learning optimal audiovisual phasing for an HMM-based control model for facial animation.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007
Proceedings of the Intelligent Virtual Agents, 7th International Conference, 2007
Scrutinizing Natural Scenes: Controlling the Gaze of an Embodied Conversational Agent.
Proceedings of the Intelligent Virtual Agents, 7th International Conference, 2007
Proceedings of the 2007 IEEE/WIC/ACM International Conference on Web Intelligence and International Conference on Intelligent Agent Technology, 2007
Proceedings of the Auditory-Visual Speech Processing 2007, 2007
Proceedings of the Auditory-Visual Speech Processing 2007, 2007
Proceedings of the Auditory-Visual Speech Processing 2007, 2007
2006
J. Comput. Inf. Technol., 2006
Proceedings of the 15th IEEE International Symposium on Robot and Human Interactive Communication, 2006
Proceedings of the Advances in Multimedia Information Processing, 2006
Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User's Attention to Points of Interest?
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006
A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
Proceedings of the ISCA Tutorial and Research Workshop on Experimental Linguistics, 2006
Proceedings of the ISCA Tutorial and Research Workshop on Experimental Linguistics, 2006
Proceedings of the 14th European Signal Processing Conference, 2006
Proceedings of the Sixth International Conference on Computer and Information Technology (CIT 2006), 2006
2005
Evaluating the pronunciation of proper names by four French grapheme-to-phoneme converters.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the Pattern Recognition and Data Mining, 2005
Proceedings of the 13th European Signal Processing Conference, 2005
Basic components of a face-to-face interaction with a conversational agent: mutual attention and deixis.
Proceedings of the 2005 joint conference on Smart objects and ambient intelligence, 2005
Capturing data and realistic 3d models for cued speech analysis and audiovisual synthesis.
Proceedings of the Auditory-Visual Speech Processing 2005, 2005
2004
Proceedings of the Fifth ISCA ITRW on Speech Synthesis, 2004
Evaluation of a Speech Cuer: From Motion Capture to a Concatenative Text-to-cued Speech System.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004
Proceedings of the 2004 International Symposium on Chinese Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
A trainable prosodic model: learning the contours implementing communicative functions within a superpositional model of intonation.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the Image Analysis and Recognition: International Conference, 2004
Proceedings of the 2004 12th European Signal Processing Conference, 2004
2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 2003 IEEE International Workshop on Analysis and Modeling of Faces and Gestures (AMFG 2003), 2003
2002
Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images.
J. Phonetics, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002
2001
Speech Commun., 2001
Proceedings of the 4th ITRW on Speech Synthesis, 2001
Proceedings of the Auditory-Visual Speech Processing, 2001
2000
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000
MOTHER: a new generation of talking heads providing a flexible articulatory control for video-realistic speech animation.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000
1999
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
Accurate estimation of sinusoidal parameters in an harmonic+noise model for speech synthesis.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999
1998
Objective evaluation of grapheme to phoneme conversion for text-to-speech synthesis in French.
Comput. Speech Lang., 1998
Evaluating the adeqnacy of synthetic prosody in signaling syntactic boundaries: methodology and first results.
Proceedings of the First International Conference on Language Resources and Evaluation, 1998
Evaluation of grapheme-to phoneme conversion for text-to-speech synthesis in French.
Proceedings of the First International Conference on Language Resources and Evaluation, 1998
Cooperation and competition of burst and formant transitions for the perception and identification of French stops.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Synergy between jaw and lips/tongue movements : consequences in articulatory modelling.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
1997
Relative contributions of noise burst and vocalic transitions to the perceptual identification of stop consonants.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
1994
Speech Communication, 1994
Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994
Proceedings of the Second ESCA/IEEE Workshop on Speech Synthesis, 1994
1993
Resonances as possible representation of speech in the auditory-to-articulatory transform.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
Proceedings of the Third European Conference on Speech Communication and Technology, 1993
1991
Proceedings of the Second European Conference on Speech Communication and Technology, 1991
1990
Automatic labeling of large prosodic databases : tools, methodology and links with a text-to-speech system.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990
Proceedings of the ESCA Workshop on Speech Synthesis, 1990
Automatic segmentation and alignment of continuous speech based on temporal decomposition model.
Proceedings of the First International Conference on Spoken Language Processing, 1990
1989
Integration of rhythmic and syntactic constraints in a model of generation of French prosody.
Speech Commun., 1989
Proceedings of the First European Conference on Speech Communication and Technology, 1989
A new algorithm for temporal decomposition of speech-application to a numerical model of coarticulation.
Proceedings of the IEEE International Conference on Acoustics, 1989
1988
Proceedings of the IEEE International Conference on Acoustics, 1988
1986
Proceedings of the IEEE International Conference on Acoustics, 1986