Yannick Estève

Orcid: 0000-0002-3656-8883

Affiliations:
  • University of Le Mans, France


According to our database1, Yannick Estève authored at least 179 papers between 1999 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech.
Comput. Speech Lang., 2024

An Analysis of Linear Complexity Attention Substitutes with BEST-RQ.
CoRR, 2024

Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation.
CoRR, 2024

Open-Source Conversational AI with SpeechBrain 1.0.
CoRR, 2024

A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding.
CoRR, 2024

Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian Dialect.
Proceedings of The Second Arabic Natural Language Processing Conference, 2024

Investigating Low-Cost LLM Annotation for Spoken Dialogue Understanding Datasets.
Proceedings of the Text, Speech, and Dialogue - 27th International Conference, 2024

Implémentation ouverte et étude de BEST-RQ pour le traitement de la parole.
Proceedings of the Actes des 35èmes Journées d'Études sur la Parole, 2024

Vérification automatique de la voix de locuteurs après resynthèse à l'aide de PPG.
Proceedings of the Actes des 35èmes Journées d'Études sur la Parole, 2024

Automatic Voice Identification after Speech Resynthesis using PPG.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

MSP-Podcast SER Challenge 2024: L'antenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition.
Proceedings of the Odyssey 2024: The Speaker and Language Recognition Workshop, 2024

Open Implementation and Study of Best-RQ for Speech Processing.
Proceedings of the IEEE International Conference on Acoustics, 2024

Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

TARIC-SLU: A Tunisian Benchmark Dataset for Spoken Language Understanding.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Speech and multilingual natural language framework for speaker change detection and diarization.
Expert Syst. Appl., 2023

Is one brick enough to break the wall of spoken dialogue state tracking?
CoRR, 2023

Acoustic and linguistic representations for speech continuous emotion recognition in call center conversations.
CoRR, 2023

Some voices are too common: Building fair speech recognition systems using the Common Voice dataset.
CoRR, 2023

OLISIA: a Cascade System for Spoken Dialogue State Tracking.
CoRR, 2023

Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data.
Proceedings of the 12th ISCA Speech Synthesis Workshop, 2023

ON-TRAC Consortium Systems for the IWSLT 2023 Dialectal and Low-resource Speech Translation Tasks.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023


Some Voices are Too Common: Building Fair Speech Recognition Systems Using the CommonVoice Dataset.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Semantic Enrichment Towards Efficient Speech Representations.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Federated Learning for ASR Based on wav2vec 2.0.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving Accented Speech Recognition with Multi-Domain Training.
Proceedings of the IEEE International Conference on Acoustics, 2023

Specialized Semantic Enrichment of Speech Representations.
Proceedings of the IEEE International Conference on Acoustics, 2023

Enhancing Expressivity Transfer in Textless Speech-to-Speech Translation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Impact Analysis of the Use of Speech and Language Models Pretrained by Self-Supersivion for Spoken Language Understanding.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

The Spoken Language Understanding MEDIA Benchmark Dataset in the Era of Deep Learning: data updates, training and evaluation tools.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Speech Resources in the Tamasheq Language.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

ON-TRAC Consortium Systems for the IWSLT 2022 Dialect and Low-resource Speech Translation Tasks.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022


End-to-end model for named entity recognition from speech without paired training data.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Privacy Attacks for Automatic Speech Recognition Acoustic Models in A Federated Learning Framework.
Proceedings of the IEEE International Conference on Acoustics, 2022

Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.
CoRR, 2021

Study on Acoustic Model Personalization in a Context of Collaborative Learning Constrained by Privacy Preservation.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Where Are We in Semantic Concept Extraction for Spoken Language Understanding?
Proceedings of the Speech and Computer - 23rd International Conference, 2021

On the Use of Self-Supervised Pre-Trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Task Agnostic and Task Specific Self-Supervised Learning from Speech with LeBenchmark.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

ON-TRAC' systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

<i>LeBenchmark</i>: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

End2End Acoustic to Semantic Transduction.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Empirical Study of End-To-End Simultaneous Speech Translation Decoding Strategies.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
A study of continuous space word and sentence representations applied to ASR error detection.
Speech Commun., 2020

Exploring Gaussian mixture model framework for speaker adaptation of deep neural network acoustic models.
CoRR, 2020

Prédiction continue de la satisfaction et de la frustration dans des conversations de centre d'appels (AlloSat : A New Call Center French Corpus for Affect Analysis).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Où en sommes-nous dans la reconnaissance des entités nommées structurées à partir de la parole ? (Where are we in Named Entity Recognition from speech ?).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Leverage Unlabeled Data for Abstractive Speech Summarization with Self-supervised Learning and Back-Summarization.
Proceedings of the Speech and Computer - 22nd International Conference, 2020

Multi-corpus Experiment on Continuous Speech Emotion Recognition: Convolution or Recurrence?
Proceedings of the Speech and Computer - 22nd International Conference, 2020

Align then Summarize: Automatic Alignment Methods for Summarization Corpus Creation.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

A Multimodal Educational Corpus of Oral Courses: Annotation, Analysis and Case Study.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

AlloSat: A New Call Center French Corpus for Satisfaction and Frustration Analysis.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Where are we in Named Entity Recognition from Speech?
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Toward Qualitative Evaluation of Embeddings for Arabic Sentiment Analysis.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

Investigating Self-Supervised Pre-Training for End-to-End Speech Translation.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Confidence Measure for Speech-to-Concept End-to-End Spoken Language Understanding.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Dialogue History Integration into End-to-End Signal-to-Concept Spoken Language Understanding Systems.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Error Analysis Applied to End-to-End Spoken Language Understanding.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task.
CoRR, 2019

Apport de l'adaptation automatique des modèles de langage pour la reconnaissance de la parole: évaluation qualitative extrinsèque dans un contexte de traitement de cours magistraux (Contribution of automatic adaptation of language models for speech recognition : extrinsic qualitative evaluation in a context of educational courses).
Proceedings of the Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Volume II : Articles courts, 2019

Curriculum d'apprentissage : reconnaissance d'entités nommées pour l'extraction de concepts sémantiques (Curriculum learning : named entity recognition for semantic concept extraction).
Proceedings of the Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Volume I : Articles longs, 2019

Plongements lexicaux spécifiques à la langue arabe : application à l'analyse d'opinions (Arabic-specific embedddings : application in Sentiment Analysis).
Proceedings of the Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Volume II : Articles courts, 2019

Recent Advances in End-to-End Spoken Language Understanding.
Proceedings of the Statistical Language and Speech Processing, 2019

Investigating Adaptation and Transfer Learning for End-to-End Spoken Language Understanding from Speech.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Qualitative Evaluation of ASR Adaptation in a Lecture Context: Application to the PASTEL Corpus.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Curriculum-Based Transfer Learning for an Effective End-to-End Spoken Language Understanding and Domain Portability.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

An Empirical Evaluation of Arabic-Specific Embeddings for Sentiment Analysis.
Proceedings of the Arabic Language Processing: From Theory to Practice, 2019

2018
Automatic speech recognition system for Tunisian dialect.
Lang. Resour. Evaluation, 2018

End-to-end named entity extraction from speech.
CoRR, 2018

Le corpus PASTEL pour le traitement automatique de cours magistraux (PASTEL corpus for automatic processing of lectures).
Proceedings of the Actes de la Conférence TALN. CORIA-TALN-RJC 2018 - Volume 1, 2018

Des représentations continues de mots pour l'analyse d'opinions en arabe: une étude qualitative (Word embeddings for Arabic sentiment analysis : a qualitative study).
Proceedings of the Actes de la Conférence TALN. CORIA-TALN-RJC 2018 - Volume 1, 2018

TED-LIUM 3: Twice as Much Data and Corpus Repartition for Experiments on Speaker Adaptation.
Proceedings of the Speech and Computer - 20th International Conference, 2018

End-To-End Named Entity And Semantic Concept Extraction From Speech.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Evaluation of Feature-Space Speaker Adaptation for End-to-End Acoustic Models.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Simulating ASR errors for training SLU systems.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

FrNewsLink : a corpus linking TV Broadcast News Segments and Press Articles.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Automatic Summarization: Towards a Revised Extract.
Proceedings of the second Conference on Language Processing and Knowledge Management, 2018

Arabic Sentiment Analysis: An Empirical Study of Machine Translation's Impact.
Proceedings of the second Conference on Language Processing and Knowledge Management, 2018

Acoustic-dependent Phonemic Transcription for Text-to-speech Synthesis.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Speaker Adaptive Training and Mixup Regularization for Neural Network Acoustic Models in Automatic Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Task Specific Sentence Embeddings for ASR Error Detection.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Multifaceted Engagement in Social Interaction with a Machine: The JOKER Project.
Proceedings of the 13th IEEE International Conference on Automatic Face & Gesture Recognition, 2018

2017
Sentiment Analysis of Tunisian Dialects: Linguistic Ressources and Experiments.
Proceedings of the Third Arabic Natural Language Processing Workshop, 2017

Enriching Confusion Networks for Post-processing.
Proceedings of the Statistical Language and Speech Processing, 2017

Automatic Speech Recognition for Tunisian Dialect.
Proceedings of the First Conference on Language Processing and Knowledge Management, 2017

Document Embeddings for Arabic Sentiment Analysis.
Proceedings of the First Conference on Language Processing and Knowledge Management, 2017

ASR Error Management for Improving Spoken Language Understanding.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Evaluating Automatic Topic Segmentation as a Segment Retrieval Task.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Error detection of grapheme-to-phoneme conversion in text-to-speech synthesis using speech signal and lexical context.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Exploration de paramètres acoustiques dérivés de GMM pour l'adaptation non supervisée de modèles acoustiques à base de réseaux de neurones profonds (Exploring GMM-derived features for unsupervised adaptation of deep neural network acoustic models).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 1 : JEP, 2016

Des Réseaux de Neurones avec Mécanisme d'Attention pour la Compréhension de la Parole (Exploring the use of Attention-Based Recurrent Neural Networks For Spoken Language Understanding ).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 1 : JEP, 2016

Utilisation des représentations continues des mots et des paramètres prosodiques pour la détection d'erreurs dans les transcriptions automatiques de la parole (Combining continuous word representation and prosodic features for ASR error detection).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 1 : JEP, 2016

Exploring GMM-derived Features for Unsupervised Adaptation of Deep Neural Network Acoustic Models.
Proceedings of the Speech and Computer - 18th International Conference, 2016

LIUM ASR systems for the 2016 Multi-Genre Broadcast Arabic challenge.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

A New Perspective on Combining GMM and DNN Frameworks for Speaker Adaptation.
Proceedings of the Statistical Language and Speech Processing, 2016

Evaluation of acoustic word embeddings.
Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, 2016

Enhancing The RATP-DECODA Corpus With Linguistic Annotations For Performing A Large Range Of NLP Tasks.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Word Embedding Evaluation and Combination.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

On the Use of Gaussian Mixture Model Framework to Improve Speaker Adaptation of Deep Neural Network Acoustic Models.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Conditional Random Fields for the Tunisian Dialect Grapheme-to-Phoneme Conversion.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Acoustic Word Embeddings for ASR Error Detection.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Title assignment for automatic topic segments in TV broadcast news.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

The EUMSSI Project - Event Understanding through Multimodal Social Stream Interpretation.
Proceedings of the 1st International Workshop on Multimodal Media Data Analytics co-located with the 22nd European Conference on Artificial Intelligence, 2016

Recent Improvements on Error Detection for Automatic Speech Recognition.
Proceedings of the 1st International Workshop on Multimodal Media Data Analytics co-located with the 22nd European Conference on Artificial Intelligence, 2016

2015
Utilisation d'annotations sémantiques pour la validation automatique d'hypothèses dans des conversations téléphoniques.
Proceedings of the Actes de la 22e conference sur le Traitement Automatique des Langues Naturelles. Articles courts, 2015

Segmentation et Titrage Automatique de Journaux Télévisés.
Proceedings of the Actes de la 22e conference sur le Traitement Automatique des Langues Naturelles. Articles courts, 2015

Combining Continuous Word Representation and Prosodic Features for ASR Error Prediction.
Proceedings of the Statistical Language and Speech Processing, 2015

The LIUM ASR and SLT systems for IWSLT 2015.
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

Nao is doing humour in the CHIST-ERA joker project.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Diachronic semantic cohesion for topic segmentation of TV broadcast news.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Word embeddings combination and neural networks for robustness in ASR error detection.
Proceedings of the 23rd European Signal Processing Conference, 2015

Arabic Transliteration of Romanized Tunisian Dialect Text: A Preliminary Investigation.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2015

CRIM and LIUM approaches for multi-genre broadcast media transcription.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

Multimodal data collection of human-robot humorous interactions in the Joker project.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

Integration of Word and Semantic Features for Theme Identification in Telephone Conversations.
Proceedings of the Natural Language Dialog Systems and Intelligent Assistants, 2015

2014
Characterizing and detecting spontaneous speech: Application to speaker role recognition.
Speech Commun., 2014

LIUM and CRIM ASR System Combination for the REPERE Evaluation Campaign.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Phonetic tool for the Tunisian Arabic.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

Recent Improvements on ILP-based Clustering for Broadcast News Speaker Diarization.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

Enhancing the TED-LIUM Corpus with Selected Data for Language Modeling and More TED Talks.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

LIUM English-to-French spoken language translation system and the Vecsys/LIUM automatic speech recognition system for Italian language for IWSLT 2014.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Is incremental cross-show speaker diarization efficient for processing large volumes of data?
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

EUMSSI: a Platform for Multimodal Analysis and Recommendation using UIMA.
Proceedings of the Workshop on Open Infrastructures and Analysis Frameworks for HLT, 2014

2013
Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding.
IEEE Trans. Speech Audio Process., 2013

LIUM ASR System for ETAPE French Evaluation Campaign: Experiments on System Combination Using Open-Source Recognizers.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

An Investigation of Single-Pass ASR System Combination for Spoken Language Understanding.
Proceedings of the Statistical Language and Speech Processing, 2013

Blip10000: a social video dataset containing SPUG content for tagging and retrieval.
Proceedings of the Multimedia Systems Conference 2013, 2013

2012
Robustesse et portabilités multilingue et multi-domaines des systèmes de compréhension de la parole : les corpus du projet PortMedia (Robustness and portability of spoken language understanding systems among languages and domains : the PORTMEDIA project) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Segmentation et Regroupement en Locuteurs d'une collection de documents audio (Cross-show speaker diarization) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Combinaison d'approches pour la reconnaissance du rôle des locuteurs (Combination of approaches for speaker role recognition) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Avancées dans le domaine de la transcription automatique par décodage guidé (Improvements on driven decoding system combination) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

TED-LIUM: an Automatic Speech Recognition dedicated corpus.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

I-vectors and ILP clustering adapted to cross-show speaker diarization.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Low latency combination of parallelized single-pass LVCSR systems.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Adaptation and Discriminative Training of Acoustic Models.
Proceedings of the Techniques for Noise Robustness in Automatic Speech Recognition, 2012

2011
LIUM's systems for the IWSLT 2011 speech translation tasks.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Investigation of Spontaneous Speech Characterization Applied to Speaker Role Recognition.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Bag of n-gram driven decoding for LVCSR system harnessing.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Automatic indexing of speech segments with spontaneity levels on large audio database.
Proceedings of the 2010 International Workshop on Searching Spontaneous Conversational Speech, 2010

The EPAC Corpus: Manual and Automatic Annotations of Conversational Speech in French Broadcast News.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

LIUM's statistical machine translation system for IWSLT 2010.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Identification of Speakers by Name Using Belief Functions.
Proceedings of the Information Processing and Management of Uncertainty in Knowledge-Based Systems. Theory and Methods, 2010

A language-identification inspired method for spontaneous speech detection.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Unsupervised model adaptation on targeted speech segments for LVCSR system combination.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

2009
Joint signal and transcription analysis for named speaker identification.
Trait. Autom. des Langues, 2009

LIUM's statistical machine translation system for IWSLT 2009.
Proceedings of the 6th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2009, 2009

LIUM's statistical machine translation systems for IWSLT 2009.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

Improvements to the LIUM French ASR system based on CMU sphinx: what helps to significantly reduce the word error rate?
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Iterative filtering of phonetic transcriptions of proper nouns.
Proceedings of the IEEE International Conference on Acoustics, 2009

Automatic named identification of speakers using diarization and ASR systems.
Proceedings of the IEEE International Conference on Acoustics, 2009

Local and global models for spontaneous speech segment detection and characterization.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
Processing and transcribing spontaneous speech.
Trait. Autom. des Langues, 2008

Correcting asr outputs: Specific solutions to specific errors in French.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Combined Systems for Automatic Phonetic Transcription of Proper Nouns.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Manual vs Assisted Transcription of Prepared and Spontaneous Speech.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

The LIUM Arabic/English statistical machine translation system for IWSLT 2008.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

Data selection and smoothing in an open-source system for the 2008 NIST machine translation evaluation.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Generalized driven decoding for speech recognition system combination.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Extracting true speaker identities from transcriptions.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

System Combination by Driven Decoding.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Speaker Diarization: About whom the Speaker is Talking ?
Proceedings of the Odyssey 2006: The Speaker and Language Recognition Workshop, 2006

Automatic Detection of Well Recognized Words in Automatic Speech Transcriptions.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

2005
The LIUM speech transcription system: a CMU Sphinx III-based system for French broadcast news.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Automatic learning of interpretation strategies for spoken dialogue systems.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
On the use of linguistic consistency in systems for human-computer dialogues.
IEEE Trans. Speech Audio Process., 2003

Conceptual decoding for spoken dialog systems.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
On the use of structures in language models for dialogue.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Modèles de langage hiérarchiques pour les applications de dialogue en parole spontanée.
Proceedings of the Actes de la 8ème conférence sur le Traitement Automatique des Langues Naturelles. Posters, 2001

Stochastic finite state automata language model triggered by dialogue states.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Dynamic selection of language models in a dialogue system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
A language model combining n-grams and stochastic finite state automata.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999


  Loading...