Denis Jouvet

Yvon Keromnes

Mathilde Dargnat

Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Adaptation de domaine non supervisée pour la reconnaissance de la langue par régularisation d'un réseau de neurones (Unsupervised domain adaptation for language identification by regularization of a neural network).

[BibT_eX]

[DOI]

Raphaël Duroselle

Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Deep Variational Metric Learning for Transfer of Expressivity in Multispeaker Text to Speech.

[BibT_eX]

[DOI]

Ajinkya Kulkarni

Vincent Colotte

Proceedings of the Statistical Language and Speech Processing, 2020

Unsupervised Regularization of the Embedding Extractor for Robust Language Identification.

[BibT_eX]

[DOI]

Raphaël Duroselle

Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Achieving Multi-Accent ASR via Unsupervised Acoustic Model Adaptation.

[BibT_eX]

[DOI]

M. A. Tugtekin Turan

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Correlation Between Prosody and Pragmatics: Case Study of Discourse Markers in French and English.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transfer Learning of the Expressivity Using FLOW Metric Learning in Multispeaker Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Ajinkya Kulkarni

Vincent Colotte

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Kaldi-Web: An Installation-Free, On-Device Speech Recognition System.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Metric Learning Loss Functions to Reduce Domain Mismatch in the x-Vector Space for Language Recognition.

[BibT_eX]

[DOI]

Raphaël Duroselle

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

Summarizing videos into a target language: Methodology, architectures and evaluation.

[BibT_eX]

[DOI]

Amaia Méndez

Elvys Linhares Pontes

Eric SanJuan

Begoña García Zapirain

J. Intell. Fuzzy Syst., 2019

Speech Processing and Prosody.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 22nd International Conference, 2019

\(F_{0}\) Modeling Using DNN for Arabic Parametric Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Recent Advances in Big Data and Deep Learning, 2019

Extractive Text-Based Summarization of Arabic Videos: Issues, Approaches and Evaluations.

[BibT_eX]

[DOI]

Proceedings of the Arabic Language Processing: From Theory to Practice, 2019

A Fine-Grained Multilingual Analysis Based on the Appraisal Theory: Application to Arabic and English Videos.

[BibT_eX]

[DOI]

Proceedings of the Arabic Language Processing: From Theory to Practice, 2019

Machine Translation on a Parallel Code-Switched Corpus.

[BibT_eX]

[DOI]

Proceedings of the Advances in Artificial Intelligence, 2019

2018

Evaluation of speech unit modelling for HMM-based speech synthesis for Arabic.

[BibT_eX]

[DOI]

Int. J. Speech Technol., 2018

DNN-Based Speech Synthesis for Arabic: Modelling and Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2018

A Proposed Methodology for Subjective Evaluation of Video and Text Summarization.

[BibT_eX]

[DOI]

Begoña García Zapirain

Mikolaj Leszczuk

Proceedings of the Multimedia and Network Information Systems, 2018

A First Summarization System of a Video in a Target Language.

[BibT_eX]

[DOI]

Amaia Méndez

Elvys Linhares Pontes

Eric SanJuan

Damian Swist

Begoña García Zapirain

Proceedings of the Multimedia and Network Information Systems, 2018

An Integrated AMIS Prototype for Automated Summarization and Translation of Newscasts and Reports.

[BibT_eX]

[DOI]

Michal Grega

Mikolaj Leszczuk

Elvys Linhares Pontes

Proceedings of the Multimedia and Network Information Systems, 2018

2017

An enhanced automatic speech recognition system for Arabic.

[BibT_eX]

[DOI]

Proceedings of the Third Arabic Natural Language Processing Workshop, 2017

Analysis and Automatic Classification of Some Discourse Particles on a Large Set of French Spoken Corpora.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2017

Towards confidence measures on fundamental frequency estimations.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Performance analysis of several pitch detection algorithms on simulated and real noisy speech data.

[BibT_eX]

[DOI]

Yves Laprie

Proceedings of the 25th European Signal Processing Conference, 2017

Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference On Arabic Computational Linguistics, 2017

2016

The IFCASL Corpus of French and German Non-native and Native Read Speech.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015

Nonparametric Uncertainty Estimation and Propagation for Noise Robust ASR.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Analysis of phone confusion matrices in a manually annotated French-German learner corpus.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2015

Combining Lexical and Prosodic Features for Automatic Detection of Sentence Modality in French.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2015

Acoustical Frame Rate and Pronunciation Variant Statistics.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2015

Discourse Particles in French: Prosodic Parameters Extraction and Analysis.

[BibT_eX]

[DOI]

Mathilde Dargnat

Proceedings of the Statistical Language and Speech Processing, 2015

Qualitative investigation of the display of speech recognition results for communication with deaf people.

[BibT_eX]

[DOI]

Agnès Piquard-Kipffer

Proceedings of the 6th Workshop on Speech and Language Processing for Assistive Technologies, 2015

Impact of frame rate on automatic speech-text alignment for corpus-based phonetic studies.

[BibT_eX]

[DOI]

Proceedings of the 18th International Congress of Phonetic Sciences, 2015

Detection of Sentence Modality on French Automatic Speech-to-text Transcriptions.

[BibT_eX]

[DOI]

Proceedings of the 1st International Conference on Natural Language and Speech Processing, 2015

Adding New Words into a Language Model using Parameters of Known Words with Similar Behavior.

[BibT_eX]

[DOI]

Proceedings of the 1st International Conference on Natural Language and Speech Processing, 2015

Textual Data Selection for Language Modelling in the Scope of Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 1st International Conference on Natural Language and Speech Processing, 2015

Discriminative uncertainty estimation for noise robust ASR.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Evaluation of PNCC and extended spectral subtraction methods for robust speech recognition.

[BibT_eX]

[DOI]

Thibaut Fux

Proceedings of the 23rd European Signal Processing Conference, 2015

2014

Structured GMM Based on Unsupervised Clustering for Recognizing Adult and Child Speech.

[BibT_eX]

[DOI]

Proceedings of the Statistical Language and Speech Processing, 2014

Designing a Bilingual Speech Corpus for French and German Language Learners: a Two-Step Process.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Hybrid language models for speech transcription.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

About combining forward and backward-based decoders for selecting data for unsupervised training of acoustic models.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Component structuring and trajectory modeling for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Fusion of multiple uncertainty estimators and propagators for noise robust ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Extension of uncertainty propagation to dynamic MFCCS for noise robust ASR.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2014

Investigating stranded GMM for improving automatic speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

2013

Comparison and Analysis of Several Phonetic Decoding Approaches.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

A Machine Learning Based Approach for Vocabulary Selection for Speech Transcription.

[BibT_eX]

[DOI]

David Langlois

Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Analysis and Combination of Forward and Backward Based Decoders for Improved Speech Transcription.

[BibT_eX]

[DOI]

Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

Automatic Detection of the Prosodic Structures of Speech Utterances.

[BibT_eX]

[DOI]

Proceedings of the Speech and Computer - 15th International Conference, 2013

Comparison of approaches for an efficient phonetic decoding.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Combining forward-based and backward-based decoders for improved speech recognition performance.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012

Gestion d'erreurs pour la fiabilisation des retours automatiques en apprentissage de la prosodie d'une langue seconde.

[BibT_eX]

[DOI]

Trait. Autom. des Langues, 2012

Détection de transcriptions incorrectes de parole non-native dans le cadre de l'apprentissage de langues étrangères (Detection of incorrect transcriptions of non-native speech in the context of foreign language learning) [in French].

[BibT_eX]

[DOI]

Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Exploitation d'une marge de tolérance de classification pour améliorer l'apprentissage de modèles acoustiques de classes en reconnaissance de la parole (Exploitation of a classification tolerance margin for improving the estimation of class-based acoustic models for speech recognition) [in French].

[BibT_eX]

[DOI]

Nicolas Vinuesa

Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Génération des prononciations de noms propres à l'aide des Champs Aléatoires Conditionnels (Pronunciation generation for proper names using Conditional Random Fields) [in French].

[BibT_eX]

[DOI]

Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Combining criteria for the detection of incorrect entries of non-native speech in the context of foreign language learning.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Class-based speech recognition using a maximum dissimilarity criterion and a tolerance classification margin.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Classification margin for improved class-based speech recognition performance.

[BibT_eX]

[DOI]

Nicolas Vinuesa

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Evaluating grapheme-to-phoneme converters in automatic speech recognition context.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011

Reliability of non-native speech automatic segmentation for prosodic feedback.

[BibT_eX]

[DOI]

Proceedings of the ISCA International Workshop on Speech and Language Technology in Education, 2011

About Handling Boundary Uncertainty in a Speaking Rate Dependent Modeling Approach.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Grapheme-to-Phoneme Conversion Using Conditional Random Fields.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010

Detailed pronunciation variant modeling for speech transcription.

[BibT_eX]

[DOI]

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2008

Modeling inter-speaker variability in speech recognition.

[BibT_eX]

[DOI]

Gwenael Cloarec

Proceedings of the IEEE International Conference on Acoustics, 2008

2007

Introduction to the Special Issue on Intrinsic Speech Variations.

[BibT_eX]

[DOI]

Speech Commun., 2007

Automatic speech recognition and speech variability: A review.

[BibT_eX]

[DOI]

Speech Commun., 2007

On using units trained on foreign data for improved multiple accent speech recognition.

[BibT_eX]

[DOI]

Speech Commun., 2007

2006

Automatic Speech Recognition and Intrinsic Speech Variation.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Using Multilingual Units for Improved Modeling of Pronunciation Variants.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2004

Context dependent "long units" for speech recognition.

[BibT_eX]

[DOI]

Ronaldo O. Messina

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Sequential clustering algorithm for Gaussian mixture initialization.

[BibT_eX]

[DOI]

Ronaldo O. Messina

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

About improving recognition of spontaneously uttered French city-names.

[BibT_eX]

[DOI]

Lionel Delphin-Poulat

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Evaluation of a noise-robust DSR front-end on Aurora databases.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Prosodic parameter for speaker identification.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

Feature vector selection to improve ASR robustness in noisy conditions.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Noise reduction for noise robust feature extraction for distributed speech recognition.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

On combining confidence measures for improved rejection of incorrect data.

[BibT_eX]

[DOI]

Guy Mercier

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

On combining recognizers for improved recognition of spelled names.

[BibT_eX]

[DOI]

S. Droguet

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

An alternative normalization scheme in HMM-based text-dependent speaker verification.

[BibT_eX]

[DOI]

O. Collin

Speech Commun., 2000

Confidence measure and incremental adaptation for the rejection of incorrect data.

[BibT_eX]

[DOI]

Nicolas Moreau

Proceedings of the IEEE International Conference on Acoustics, 2000

Detecting the end of spellings using statistics on recognized letter sequences for spelled names recognition.

[BibT_eX]

[DOI]

Stephan Hanel

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Derivation of the optimal set of phonetic transcriptions for a word from its acoustic realizations.

[BibT_eX]

[DOI]

Houda Mokbel

Speech Commun., 1999

Use of a confidence measure based on frame level likelihood ratios for the rejection of incorrect data.

[BibT_eX]

[DOI]

Nicolas Moreau

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Recognition of spelled names over the telephone and rejection of data out of the spelling lexicon.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Selective prosodic post-processing for improving recognition of French telephone numbers.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Hypothesis dependent threshold setting for improved out-of-vocabulary data rejection.

[BibT_eX]

[DOI]

Guy Mercier

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

An algorithm for maximum likelihood estimation of hidden Markov models with unknown state-tying.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 1998

1997

Towards improving ASR robustness for PSN and GSM telephone applications.

[BibT_eX]

[DOI]

Speech Commun., 1997

Optimizing feature set for speaker verification.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 1997

Automatic derivation of multiple variants of phonetic transcriptions from acoustic signals.

[BibT_eX]

[DOI]

Houda Mokbel

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Design and analysis of a German telephone speech database for phoneme based training.

[BibT_eX]

[DOI]

Stefan Feldes

Bernhard Kaspar

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Usefulness of phonetic parameters in a rejection procedure of an HMM-based speech recognition system.

[BibT_eX]

[DOI]

Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Adapting PSN recognition models to the GSM environment by using spectral transformation.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996

Deconvolution of telephone line effects for speech recognition.

[BibT_eX]

[DOI]

Chafic Mokbel

Speech Commun., 1996

Parameter tying for flexible speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Bayesian adaptation of speech recognizers to field speech data.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Comparison of several preprocessing techniques for robust speech recognition over both PSN and GSM networks.

[BibT_eX]

[DOI]

Proceedings of the 8th European Signal Processing Conference, 1996

1995

Operational and experimental French telecommunication services using CNET speech recognition and text-to-speech synthesis.

[BibT_eX]

[DOI]

Speech Commun., 1995

Improving recognition performances on field data with an a-priori segmentation of the speech signal.

[BibT_eX]

[DOI]

Thierry Moudenc

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Blind equalization using adaptive filtering for improving speech recognition over telephone.

[BibT_eX]

[DOI]

Chafic Mokbel

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Error analysis on field data and improved garbage HMM modelling.

[BibT_eX]

[DOI]

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

On using a priori segmentation of the speech signal in an N-best solutions post-processing.

[BibT_eX]

[DOI]

Thierry Moudenc

Proceedings of the 1995 International Conference on Acoustics, 1995

1994

Compensation of telephone line effects for robust speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Structure of allophonic models and reliable estimation of the contextual parameters.

[BibT_eX]

[DOI]

A. Stouff

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1993

On-line adaptation of a speech recognizer to variations in telephone line conditions.

[BibT_eX]

[DOI]

Chafic Mokbel

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Segmental post-processing of the n-best solutions in a speech recognition system.

[BibT_eX]

[DOI]

M. N. Lokbani

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Application of the n-best solutions algorithm to speaker-independent spelling recognition over the telephone.

[BibT_eX]

[DOI]

M. N. Lokbani

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Speaker-independent spelling recognition over the telephone.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1993

1991

MAIRIEVOX: A voice-activated information system.

[BibT_eX]

[DOI]

Christian Gagnoulet

J. Damay

Speech Commun., 1991

Automatic adjustments of the structure of Markov models for speech recognition applications.

[BibT_eX]

[DOI]

Laurent Mauuary

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

On the modelization of allophones in an HMM based speech recognition system.

[BibT_eX]

[DOI]

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1989

An acoustic-phonetic decoder an automatic segmentation algorithm.

[BibT_eX]

[DOI]

V. Le Maire

Régine André-Obrecht

Proceedings of the First European Conference on Speech Communication and Technology, 1989

1986

A new network-based speaker-independent connected-word recognition system.

[BibT_eX]

[DOI]

Dominique Dubois

Proceedings of the IEEE International Conference on Acoustics, 1986

1984

One-pass syntax-directed connected-word recognition in a time-sharing environment.

[BibT_eX]

[DOI]