Denis Jouvet

  • INRIA, France

According to our database1, Denis Jouvet authored at least 137 papers between 1984 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 


Online presence:



Evaluation of Speaker Anonymization on Emotional Speech.
CoRR, 2023

Self-supervised learning with Diffusion-based multichannel speech enhancement for speaker verification under noisy conditions.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

How to Leverage DNN-based speech enhancement for multi-channel speaker verification?
CoRR, 2022

Privacy-Preserving Speech Representation Learning using Vector Quantization.
CoRR, 2022

Joint Optimization of Diffusion Probabilistic-Based Multichannel Speech Enhancement with Far-Field Speaker Verification.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Learning Noise Robust ResNet-Based Speaker Embedding for Speaker Recognition.
Proceedings of the Odyssey 2022: The Speaker and Language Recognition Workshop, 28 June, 2022

Adapting Language Models When Training on Privacy-Transformed Data.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Barlow Twins self-supervised learning for robust speaker recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Analysis of expressivity transfer in non-autoregressive end-to-end multispeaker TTS systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Are disentangled representations all you need to build speaker anonymization systems?
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Comprehensive Exploration of Noise Robustness and Noise Compensation in ResNet and TDNN-based Speaker Recognition Systems.
Proceedings of the 30th European Signal Processing Conference, 2022

Multi-stage attention for fine-grained expressivity transfer in multispeaker text-to-speech system.
Proceedings of the 30th European Signal Processing Conference, 2022

Duration modelling and evaluation for Arabic statistical parametric speech synthesis.
Multim. Tools Appl., 2021

On the invertibility of a voice privacy system using embedding alignement.
CoRR, 2021

A Study of F0 Modification for X-Vector Based Speech Pseudonymization Across Gender.
CoRR, 2021

Evaluating X-Vector-Based Speaker Anonymization Under White-Box Assessment.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Language Recognition on Unknown Conditions: The LORIA-Inria-MULTISPEECH System for AP20-OLR Challenge.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Modeling and Training Strategies for Language Recognition Systems.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Compensate multiple distortions for speaker recognition systems.
Proceedings of the 29th European Signal Processing Conference, 2021

Improving transfer of expressivity for end-to-end multispeaker text-to-speech synthesis.
Proceedings of the 29th European Signal Processing Conference, 2021

On the Invertibility of a Voice Privacy System Using Embedding Alignment.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Projet AMIS : résumé et traduction automatique de vidéos (AMIS project : automatic summarization and translation of videos).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Étude comparative de corrélats prosodiques de marqueurs discursifs français et anglais selon leur fonction pragmatique (Comparative study on prosodic correlates of discourse markers in French and English according to their pragmatic function).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Adaptation de domaine non supervisée pour la reconnaissance de la langue par régularisation d'un réseau de neurones (Unsupervised domain adaptation for language identification by regularization of a neural network).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Deep Variational Metric Learning for Transfer of Expressivity in Multispeaker Text to Speech.
Proceedings of the Statistical Language and Speech Processing, 2020

Unsupervised Regularization of the Embedding Extractor for Robust Language Identification.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Achieving Multi-Accent ASR via Unsupervised Acoustic Model Adaptation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Correlation Between Prosody and Pragmatics: Case Study of Discourse Markers in French and English.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transfer Learning of the Expressivity Using FLOW Metric Learning in Multispeaker Text-to-Speech Synthesis.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Kaldi-Web: An Installation-Free, On-Device Speech Recognition System.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Metric Learning Loss Functions to Reduce Domain Mismatch in the x-Vector Space for Language Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Summarizing videos into a target language: Methodology, architectures and evaluation.
J. Intell. Fuzzy Syst., 2019

Speech Processing and Prosody.
Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019

\(F_{0}\) Modeling Using DNN for Arabic Parametric Speech Synthesis.
Proceedings of the Recent Advances in Big Data and Deep Learning, 2019

Extractive Text-Based Summarization of Arabic Videos: Issues, Approaches and Evaluations.
Proceedings of the Arabic Language Processing: From Theory to Practice, 2019

A Fine-Grained Multilingual Analysis Based on the Appraisal Theory: Application to Arabic and English Videos.
Proceedings of the Arabic Language Processing: From Theory to Practice, 2019

Machine Translation on a Parallel Code-Switched Corpus.
Proceedings of the Advances in Artificial Intelligence, 2019

Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic.
Int. J. Speech Technol., 2018

DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation.
Proceedings of the Statistical Language and Speech Processing, 2018

A Proposed Methodology for Subjective Evaluation of Video and Text Summarization.
Proceedings of the Multimedia and Network Information Systems, 2018

An Integrated AMIS Prototype for Automated Summarization and Translation of Newscasts and Reports.
Proceedings of the Multimedia and Network Information Systems, 2018

An enhanced automatic speech recognition system for Arabic.
Proceedings of the Third Arabic Natural Language Processing Workshop, 2017

Analysis and Automatic Classification of Some Discourse Particles on a Large Set of French Spoken Corpora.
Proceedings of the Statistical Language and Speech Processing, 2017

Towards confidence measures on fundamental frequency estimations.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Performance analysis of several pitch detection algorithms on simulated and real noisy speech data.
Proceedings of the 25th European Signal Processing Conference, 2017

Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect.
Proceedings of the Third International Conference On Arabic Computational Linguistics, 2017

The IFCASL Corpus of French and German Non-native and Native Read Speech.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Nonparametric Uncertainty Estimation and Propagation for Noise Robust ASR.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Analysis of phone confusion matrices in a manually annotated French-German learner corpus.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015

Combining Lexical and Prosodic Features for Automatic Detection of Sentence Modality in French.
Proceedings of the Statistical Language and Speech Processing, 2015

Acoustical Frame Rate and Pronunciation Variant Statistics.
Proceedings of the Statistical Language and Speech Processing, 2015

Discourse Particles in French: Prosodic Parameters Extraction and Analysis.
Proceedings of the Statistical Language and Speech Processing, 2015

Qualitative investigation of the display of speech recognition results for communication with deaf people.
Proceedings of the 6th Workshop on Speech and Language Processing for Assistive Technologies, 2015

Impact of frame rate on automatic speech-text alignment for corpus-based phonetic studies.
Proceedings of the 18th International Congress of Phonetic Sciences, 2015

Detection of Sentence Modality on French Automatic Speech-to-text Transcriptions.
Proceedings of the 1st International Conference on Natural Language and Speech Processing, 2015

Adding New Words into a Language Model using Parameters of Known Words with Similar Behavior.
Proceedings of the 1st International Conference on Natural Language and Speech Processing, 2015

Textual Data Selection for Language Modelling in the Scope of Automatic Speech Recognition.
Proceedings of the 1st International Conference on Natural Language and Speech Processing, 2015

Discriminative uncertainty estimation for noise robust ASR.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Evaluation of PNCC and extended spectral subtraction methods for robust speech recognition.
Proceedings of the 23rd European Signal Processing Conference, 2015

Structured GMM Based on Unsupervised Clustering for Recognizing Adult and Child Speech.
Proceedings of the Statistical Language and Speech Processing, 2014

Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Hybrid language models for speech transcription.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

About combining forward and backward-based decoders for selecting data for unsupervised training of acoustic models.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Component structuring and trajectory modeling for speech recognition.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Fusion of multiple uncertainty estimators and propagators for noise robust ASR.
Proceedings of the IEEE International Conference on Acoustics, 2014

Extension of uncertainty propagation to dynamic MFCCS for noise robust ASR.
Proceedings of the IEEE International Conference on Acoustics, 2014

Investigating stranded GMM for improving automatic speech recognition.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

Comparison and Analysis of Several Phonetic Decoding Approaches.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

A Machine Learning Based Approach for Vocabulary Selection for Speech Transcription.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Analysis and Combination of Forward and Backward Based Decoders for Improved Speech Transcription.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Automatic Detection of the Prosodic Structures of Speech Utterances.
Proceedings of the Speech and Computer - 15th International Conference, 2013

Comparison of approaches for an efficient phonetic decoding.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Combining forward-based and backward-based decoders for improved speech recognition performance.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Gestion d'erreurs pour la fiabilisation des retours automatiques en apprentissage de la prosodie d'une langue seconde.
Trait. Autom. des Langues, 2012

Détection de transcriptions incorrectes de parole non-native dans le cadre de l'apprentissage de langues étrangères (Detection of incorrect transcriptions of non-native speech in the context of foreign language learning) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Exploitation d'une marge de tolérance de classification pour améliorer l'apprentissage de modèles acoustiques de classes en reconnaissance de la parole (Exploitation of a classification tolerance margin for improving the estimation of class-based acoustic models for speech recognition) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Génération des prononciations de noms propres à l'aide des Champs Aléatoires Conditionnels (Pronunciation generation for proper names using Conditional Random Fields) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Combining criteria for the detection of incorrect entries of non-native speech in the context of foreign language learning.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Class-based speech recognition using a maximum dissimilarity criterion and a tolerance classification margin.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Classification margin for improved class-based speech recognition performance.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Evaluating grapheme-to-phoneme converters in automatic speech recognition context.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Reliability of non-native speech automatic segmentation for prosodic feedback.
Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2011

About Handling Boundary Uncertainty in a Speaking Rate Dependent Modeling Approach.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Grapheme-to-Phoneme Conversion Using Conditional Random Fields.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Detailed pronunciation variant modeling for speech transcription.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Modeling inter-speaker variability in speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2008

Introduction to the Special Issue on Intrinsic Speech Variations.
Speech Commun., 2007

Automatic speech recognition and speech variability: A review.
Speech Commun., 2007

On using units trained on foreign data for improved multiple accent speech recognition.
Speech Commun., 2007

Automatic Speech Recognition and Intrinsic Speech Variation.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Using Multilingual Units for Improved Modeling of Pronunciation Variants.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Context dependent "long units" for speech recognition.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Sequential clustering algorithm for Gaussian mixture initialization.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

About improving recognition of spontaneously uttered French city-names.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Evaluation of a noise-robust DSR front-end on Aurora databases.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Prosodic parameter for speaker identification.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Feature vector selection to improve ASR robustness in noisy conditions.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Noise reduction for noise robust feature extraction for distributed speech recognition.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

On combining confidence measures for improved rejection of incorrect data.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

On combining recognizers for improved recognition of spelled names.
Proceedings of the IEEE International Conference on Acoustics, 2001

An alternative normalization scheme in HMM-based text-dependent speaker verification.
Speech Commun., 2000

Confidence measure and incremental adaptation for the rejection of incorrect data.
Proceedings of the IEEE International Conference on Acoustics, 2000

Detecting the end of spellings using statistics on recognized letter sequences for spelled names recognition.
Proceedings of the IEEE International Conference on Acoustics, 2000

Derivation of the optimal set of phonetic transcriptions for a word from its acoustic realizations.
Speech Commun., 1999

Use of a confidence measure based on frame level likelihood ratios for the rejection of incorrect data.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Recognition of spelled names over the telephone and rejection of data out of the spelling lexicon.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Selective prosodic post-processing for improving recognition of French telephone numbers.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Hypothesis dependent threshold setting for improved out-of-vocabulary data rejection.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

An algorithm for maximum likelihood estimation of hidden Markov models with unknown state-tying.
IEEE Trans. Speech Audio Process., 1998

Towards improving ASR robustness for PSN and GSM telephone applications.
Speech Commun., 1997

Optimizing feature set for speaker verification.
Pattern Recognit. Lett., 1997

Automatic derivation of multiple variants of phonetic transcriptions from acoustic signals.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Design and analysis of a German telephone speech database for phoneme based training.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Usefulness of phonetic parameters in a rejection procedure of an HMM-based speech recognition system.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Adapting PSN recognition models to the GSM environment by using spectral transformation.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Deconvolution of telephone line effects for speech recognition.
Speech Commun., 1996

Parameter tying for flexible speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Bayesian adaptation of speech recognizers to field speech data.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Comparison of several preprocessing techniques for robust speech recognition over both PSN and GSM networks.
Proceedings of the 8th European Signal Processing Conference, 1996

Operational and experimental French telecommunication services using CNET speech recognition and text-to-speech synthesis.
Speech Commun., 1995

Improving recognition performances on field data with an a-priori segmentation of the speech signal.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Blind equalization using adaptive filtering for improving speech recognition over telephone.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Error analysis on field data and improved garbage HMM modelling.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

On using a priori segmentation of the speech signal in an N-best solutions post-processing.
Proceedings of the 1995 International Conference on Acoustics, 1995

Compensation of telephone line effects for robust speech recognition.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Structure of allophonic models and reliable estimation of the contextual parameters.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

On-line adaptation of a speech recognizer to variations in telephone line conditions.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Segmental post-processing of the n-best solutions in a speech recognition system.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Application of the n-best solutions algorithm to speaker-independent spelling recognition over the telephone.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Speaker-independent spelling recognition over the telephone.
Proceedings of the IEEE International Conference on Acoustics, 1993

MAIRIEVOX: A voice-activated information system.
Speech Commun., 1991

Automatic adjustments of the structure of Markov models for speech recognition applications.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

On the modelization of allophones in an HMM based speech recognition system.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

An acoustic-phonetic decoder an automatic segmentation algorithm.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

A new network-based speaker-independent connected-word recognition system.
Proceedings of the IEEE International Conference on Acoustics, 1986

One-pass syntax-directed connected-word recognition in a time-sharing environment.
Proceedings of the IEEE International Conference on Acoustics, 1984
