Thierry Dutoit
Orcid: 0000-0001-7024-2150
According to our database1,
Thierry Dutoit
authored at least 225 papers
between 1993 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
Latent Space Interpolation of Synthesizer Parameters Using Timbre-Regularized Auto-Encoders.
IEEE ACM Trans. Audio Speech Lang. Process., 2024
TIPAA-SSL: Text Independent Phone-to-Audio Alignment based on Self-Supervised Learning and Knowledge Transfer.
CoRR, 2024
Self-Avatar Animation in Virtual Reality: Impact of Motion Signals Artifacts on the Full-Body Pose Reconstruction.
CoRR, 2024
Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024
2023
A Recipe for Efficient SBIR Models: Combining Relative Triplet Loss with Batch Normalization and Knowledge Distillation.
CoRR, 2023
Proceedings of the 2023 ACM International Conference on Interactive Media Experiences, 2023
Validating Objective Evaluation Metric: Is Fréchet Motion Distance able to Capture Foot Skating Artifacts ?
Proceedings of the 2023 ACM International Conference on Interactive Media Experiences, 2023
Objective Evaluation Metric for Motion Generative Models: Validating Fréchet Motion Distance on Foot Skating and Over-smoothing Artifacts.
Proceedings of the 16th ACM SIGGRAPH Conference on Motion, Interaction and Games, 2023
The Limitations of Current Similarity-Based Objective Metrics in the Context of Human-Agent Interaction Applications.
Proceedings of the International Conference on Multimodal Interaction, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
Self-Avatar's Animation in VR: Extending Sparse Motion Features with Cartesian Coordinates in Transformer-based Model.
Proceedings of the 34th British Machine Vision Conference Workshop Proceedings, 2023
Cardiotocography Signal Abnormality Detection Based on Deep Semi-Unsupervised Learning.
Proceedings of the IEEE/ACM 10th International Conference on Big Data Computing, 2023
2022
IEEE Trans. Circuits Syst. Video Technol., 2022
Informatics, 2022
Analysis of Co-Laughter Gesture Relationship on RGB videos in Dyadic Conversation Contex.
CoRR, 2022
CoRR, 2022
Proceedings of the SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Posters, Vancouver BC Canada, August 7, 2022
Spatio-Temporal Analysis of Transformer based Architecture for Attention Estimation from EEG.
Proceedings of the 26th International Conference on Pattern Recognition, 2022
Towards Lightweight Neural Animation: Exploration of Neural Network Pruning in Mixture of Experts-based Animation Models.
Proceedings of the 17th International Joint Conference on Computer Vision, 2022
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022
Proceedings of the CBMI 2022: International Conference on Content-based Multimedia Indexing, Graz, Austria, September 14, 2022
Proceedings of the 10th International Conference on Affective Computing and Intelligent Interaction, 2022
2021
ICE-Talk 2: Interface for Controllable Expressive TTS with perceptual assessment tool.
Softw. Impacts, 2021
J. Multimodal User Interfaces, 2021
Analysis and Assessment of Controllability of an Expressive Deep Learning-Based TTS System.
Informatics, 2021
Proceedings of the 24th International Conference on Digital Audio Effects, 2021
2020
Image Vis. Comput., 2020
Detection and identification of European woodpeckers with deep convolutional neural networks.
Ecol. Informatics, 2020
Proceedings of the 8th IEEE International Conference on Serious Games and Applications for Health, 2020
Analytic vs. holistic approaches for the live search of sound presets using graphical interpolation.
Proceedings of the 20th International Conference on New Interfaces for Musical Expression, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Proceedings of the Companion of the 2020 ACM/IEEE International Conference on Human-Robot Interaction, 2020
Unsupervised depth prediction from monocular sequences: Improving performances through instance segmentation.
Proceedings of the 17th Conference on Computer and Robot Vision, 2020
An Experimental Study of the Impact of Pre-Training on the Pruning of a Convolutional Neural Network.
Proceedings of the APPIS 2020: 3rd International Conference on Applications of Intelligent Systems, 2020
Proceedings of the IEEE International Conference on Artificial Intelligence and Virtual Reality, 2020
2019
The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach.
CoRR, 2019
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis Through Audio Analysis.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the Intelligent Systems and Applications, 2019
Proceedings of the Intelligent Systems and Applications, 2019
Proceedings of the Computer Vision Systems, 12th International Conference, 2019
An End-to-end Network to Synthesize Intonation Using a Generalized Command Response Model.
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the 8th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos, 2019
2018
The Emotional Voices Database: Towards Controlling the Emotion Dimension in Voice Generation Systems.
CoRR, 2018
Proceedings of the Statistical Language and Speech Processing, 2018
Proceedings of the International Conference on 3D Immersion, 2018
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018
2017
3D skeleton-based action recognition by representing motion capture sequences as 2D-RGB images.
Comput. Animat. Virtual Worlds, 2017
Amused speech components analysis and classification: Towards an amusement arousal level assessment system.
Comput. Electr. Eng., 2017
Proceedings of the Statistical Language and Speech Processing, 2017
Proceedings of the Statistical Language and Speech Processing, 2017
Morphology Independent Feature Engineering in Motion Capture Database for Gesture Evaluation.
Proceedings of the 4th International Conference on Movement Computing, 2017
Investigating the impact of the training data volume for robust speech recognition using multi-task learning.
Proceedings of the 2017 IEEE International Symposium on Signal Processing and Information Technology, 2017
Portable C++ Framework for Low-Latency Musical Touch Interaction with Geometrical Shapes.
Proceedings of the 2017 International Computer Music Conference, 2017
2016
Proceedings of the Toward Robotic Socially Believable Behaving Systems - Volume I, 2016
Identification of European woodpecker species in audio recordings from their drumming rolls.
Ecol. Informatics, 2016
I-Vector estimation as auxiliary task for Multi-Task Learning based acoustic modeling for automatic speech recognition.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016
Proceedings of the 23rd International Conference on Pattern Recognition, 2016
Towards a listening agent: a system generating audiovisual laughs and smiles to show interest.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016
Proceedings of the 24th European Signal Processing Conference, 2016
Audio affect burst synthesis: A multilevel synthesis system for emotional expressions.
Proceedings of the 24th European Signal Processing Conference, 2016
Proceedings of the 24th European Symposium on Artificial Neural Networks, 2016
Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, 2016
A Semantic and Content-Based Search User Interface for Browsing Large Collections of Foley Sounds.
Proceedings of the Audio Mostly 2016, Norrköping, Sweden, October 4-6, 2016, 2016
2015
EAI Endorsed Trans. Creative Technol., 2015
Proceedings of the 17th IEEE International Workshop on Multimedia Signal Processing, 2015
Proceedings of the 8th ACM SIGGRAPH Conference on Motion in Games, 2015
UMons at MediaEval 2015 Affective Impact of Movies Task including Violent Scenes Detection.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
An HMM approach for synthesizing amused speech with a controllable intensity of smile.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015
Towards a level assessment system of amusement in speech signals: Amused speech components classification.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
Shaking and speech-smile vowels classification: An attempt at amusement arousal estimation from speech signals.
Proceedings of the 2015 IEEE Global Conference on Signal and Information Processing, 2015
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015
Proceedings of the 23rd European Signal Processing Conference, 2015
Proceedings of the 23rd European Signal Processing Conference, 2015
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015
2014
IEEE J. Sel. Top. Signal Process., 2014
Neurocomputing, 2014
Comput. Speech Lang., 2014
Tangible needle, digital haystack: tangible interfaces for reusing media content organized by similarity.
Proceedings of the Eighth International Conference on Tangible, 2014
Scenarizing CADastre Exquisse: A Crossover between Snoezeling in Hospitals/Domes, and Authoring/Experiencing Soundful Comic Strips.
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014
The AV-LASYN Database : A synchronous corpus of audio and 3D facial marker data for audio-visual laughter synthesis.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Proceedings of the 15th International Society for Music Information Retrieval Conference, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the Audio Mostly 2014, AM '14, 2014
2013
IEEE J. Biomed. Health Informatics, 2013
RARE2012: A multi-scale rarity-based saliency detection with its comparative statistical analysis.
Signal Process. Image Commun., 2013
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Mage - reactive articulatory feature control of HMM-based parametric speech synthesis.
Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013
Proceedings of the 13th International Conference on New Interfaces for Musical Expression, 2013
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013
MAGEFACE: Performative Conversion of Facial Characteristics into Speech Synthesis Parameters.
Proceedings of the Intelligent Technologies for Interactive Entertainment, 2013
Proceedings of the Intelligent Technologies for Interactive Entertainment, 2013
MashtaCycle: On-Stage Improvised Audio Collage by Content-Based Similarity and Gesture Recognition.
Proceedings of the Intelligent Technologies for Interactive Entertainment, 2013
A quantitative comparison of glottal closure instant estimation algorithms on a large variety of singing sounds.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the IEEE International Conference on Image Processing, 2013
Proceedings of the IEEE International Conference on Image Processing, 2013
Proceedings of the IEEE International Conference on Computer Vision, 2013
Proceedings of the IEEE International Conference on Acoustics, 2013
A comparative study of pitch extraction algorithms on a large variety of singing sounds.
Proceedings of the IEEE International Conference on Acoustics, 2013
A quantitative comparison of the most sophisticated EOG-based eye movement recognition techniques.
Proceedings of the 2013 IEEE Symposium on Computational Intelligence, 2013
Automatic Phonetic Transcription of Laughter and Its Application to Laughter Synthesis.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013
2012
Continuous Control of Style and Style Transitions through Linear Interpolation in Hidden Markov Model Based Walk Synthesis.
Trans. Comput. Sci., 2012
IEEE Trans. Speech Audio Process., 2012
IEEE Trans. Speech Audio Process., 2012
EURASIP J. Adv. Signal Process., 2012
Comput. Speech Lang., 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the 12th International Conference on New Interfaces for Musical Expression, 2012
Proceedings of the 12th International Conference on New Interfaces for Musical Expression, 2012
Proceedings of the Motion in Games - 5th International Conference, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 2012 International Joint Conference on Neural Networks (IJCNN), 2012
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2012
EEG and Human Locomotion - Descending Commands and Sensory Feedback should be Disentangled from Artifacts Thanks to New Experimental Protocols Position Paper.
Proceedings of the BIOSIGNALS 2012, 2012
Proceedings of the Computer Vision - ACCV 2012, 2012
2011
Causal-anticausal decomposition of speech using complex cepstrum for glottal source estimation.
Speech Commun., 2011
J. Multimodal User Interfaces, 2011
Optimizing the Performances of a P300-Based Brain-Computer Interface in Ambulatory Conditions.
IEEE J. Emerg. Sel. Topics Circuits Syst., 2011
Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2011
Proceedings of the Advances in Nonlinear Speech Processing, 2011
Proceedings of the Advances in Nonlinear Speech Processing, 2011
Proceedings of the Intelligent Technologies for Interactive Entertainment, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the Computer Vision Systems - 8th International Conference, 2011
Analysis-by-Performance: Gesturally-Controlled Voice Synthesis as an Input for Modelling of Vibrato in Singing.
Proceedings of the 2011 International Computer Music Conference, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the 19th European Signal Processing Conference, 2011
Automatic sleep spindles detection - Overview and development of a standard proposal assessment method.
Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2011
Continuous Control of Style through Linear Interpolation in Hidden Markov Model Based Stylistic Walk Synthesis.
Proceedings of the 2011 International Conference on Cyberworlds, 2011
Proceedings of the 2011 IEEE Symposium on Computational Intelligence, 2011
ECG Artifact Removal from Surface EMG Signals by Combining Empirical Mode Decomposition and Independent Component Analysis.
Proceedings of the BIOSIGNALS 2011, 2011
A Phonetic Analysis of Natural Laughter, for Use in Automatic Laughter Processing Systems.
Proceedings of the Affective Computing and Intelligent Interaction, 2011
Proceedings of the Robot-Human Teamwork in Dynamic Adverse Environment, 2011
2010
J. Multimodal User Interfaces, 2010
Comput. Methods Programs Biomed., 2010
Proceedings of the Seventh ISCA Tutorial and Research Workshop on Speech Synthesis, 2010
DeviceCycle: Rapid and Reusable Prototyping of Gestural Interfaces, Applied to Audio Browsing by Similarity.
Proceedings of the 10th International Conference on New Interfaces for Musical Expression, 2010
Proceedings of the Motion in Games - Third International Conference, 2010
Proceedings of the International Conference on Language Resources and Evaluation, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 18th European Signal Processing Conference, 2010
2009
On the Use of the Correlation between Acoustic Descriptors for the Normal/Pathological Voices Discrimination.
EURASIP J. Adv. Signal Process., 2009
Proceedings of the Advances in Nonlinear Speech Processing, 2009
Advanced Techniques for Vertical Tablet Playing A Overview of Two Years of Practicing the HandSketch 1.x.
Proceedings of the 9th International Conference on New Interfaces for Musical Expression, 2009
On the mutual information of glottal source estimation techniques for the automatic detection of speech pathologies.
Proceedings of the Sixth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2009
A deterministic plus stochastic model of the residual signal for improved parametric speech synthesis.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
On the mutual information between source and filter contributions for voice pathology detection.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the 2009 IEEE International Conference on Robotics and Automation, 2009
Using a pitch-synchronous residual codebook for hybrid HMM/frame selection speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2009
Proceedings of the 17th European Signal Processing Conference, 2009
2008
Cancelling ECG Artifacts in EEG Using a Modified Independent Component Analysis Approach.
EURASIP J. Adv. Signal Process., 2008
Comput. Methods Programs Biomed., 2008
Glottal Source Estimation Robustness - A Comparison of Sensitivity of Voice Source Estimation Techniques.
Proceedings of the SIGMAP 2008, 2008
Proceedings of the 10th International Conference on Multimodal Interfaces, 2008
Voice source parameters estimation by fitting the glottal formant and the inverse filtering open phase.
Proceedings of the 2008 16th European Signal Processing Conference, 2008
2007
J. Multimodal User Interfaces, 2007
Proceedings of the Advances in Nonlinear Speech Processing, 2007
HandSketch Bi-Manual Controller Investigation on Expressive Control Issues of an Augmented Tablet.
Proceedings of the Seventh International Conference on New Interfaces for Musical Expression, 2007
Improvement of source-tract decomposition of speech using analogy with LF model for glottal source and tube model for vocal tract.
Proceedings of the Fifth International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications, 2007
RAMCESS/handsketch: a multi-representation framework for realtime and expressive singing synthesis.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Causal/anticausal Decomposition for mixed-phase Description of brass and Bowed String sounds.
Proceedings of the 2007 International Computer Music Conference, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
2006
IEEE Trans. Speech Audio Process., 2006
Dynamic Bayesian Networks for NLU Simulation with Applications to Dialog Optimal Strategy Learning.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
Proceedings of the 28th International Conference of the IEEE Engineering in Medicine and Biology Society, 2006
2005
Zeros of Z-transform representation with application to source-filter separation in speech.
IEEE Signal Process. Lett., 2005
Proceedings of the Progress in Nonlinear Speech Processing, 2005
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005
Proceedings of the 13th European Signal Processing Conference, 2005
Proceedings of the 13th European Signal Processing Conference, 2005
2004
Proceedings of the Nonlinear Speech Modeling and Applications, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 8th International Conference on Spoken Language Processing, 2004
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004
Appropriate windowing for group delay analysis and roots of z-transform of speech signals.
Proceedings of the 2004 12th European Signal Processing Conference, 2004
2003
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003
Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, 2003
2001
Proceedings of the 4th ITRW on Speech Synthesis, 2001
Proceedings of the 4th ITRW on Speech Synthesis, 2001
2000
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000
1998
Proceedings of the Third ESCA/COCOSDA Workshop on Speech Synthesis, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998
Comparison of two different text-to-speech alignment systems: Speech synthesis based vs. hybrid HMM/ANN.
Proceedings of the 9th European Signal Processing Conference, 1998
1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
1996
Speech Commun., 1996
The MBROLA project: towards a set of high quality speech synthesizers free of use for non commercial purposes.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
1994
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994
1993
MBR-PSOLA: Text-To-Speech synthesis based on an MBE re-synthesis of the segments database.
Speech Commun., 1993
An analysis of the performances of the MBE model when used in the context of a text-to-speech system.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993