Benjamin Lecouteux

Orcid: 0000-0003-3000-6190

According to our database1, Benjamin Lecouteux authored at least 106 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.



In proceedings 
PhD thesis 




LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech.
Comput. Speech Lang., 2024

Une approche par graphe pour l'analyse syntaxique en dépendances de bout en bout de la parole.
Proceedings of the Actes de la 31ème Conférence sur le Traitement Automatique des Langues Naturelles, 2024

Approches cascade et de bout-en-bout pour la traduction automatique de la parole en pictogrammes.
Proceedings of the Actes de la 31ème Conférence sur le Traitement Automatique des Langues Naturelles, 2024

Un corpus multimodal alignant parole, transcription et séquences de pictogrammes dédié à la traduction automatique de la parole vers des pictogrammes.
Proceedings of the Actes de la 31ème Conférence sur le Traitement Automatique des Langues Naturelles, 2024

Technologies de la parole et données de terrain : le cas du créole haïtien.
Proceedings of the Actes de la 31ème Conférence sur le Traitement Automatique des Langues Naturelles, 2024

A Multimodal French Corpus of Aligned Speech, Text, and Pictogram Sequences for Speech-to-Pictogram Machine Translation.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

What Has LeBenchmark Learnt about French Syntax?
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Text-to-movie authoring of anatomy lessons.
Artif. Intell. Medicine, December, 2023

Simple, Simpler and Beyond: A Fine-Tuning BERT-Based Approach to Enhance Sentence Complexity Assessment for Text Simplification.
Proceedings of the 6th International Conference on Natural Language and Speech Processing (ICNLSP 2023), 2023

Pre-training for Speech Translation: CTC Meets Optimal Transport.
Proceedings of the International Conference on Machine Learning, 2023

PROPICTO: Developing Speech-to-Pictograph Translation Systems to Enhance Communication Accessibility.
Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023

End-to-End Dependency Parsing of Spoken French.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Automatic Speech Recognition and Query By Example for Creole Languages Documentation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.
CoRR, 2021

Human beatbox sound recognition using an automatic speech recognition toolkit.
Biomed. Signal Process. Control., 2021

Task Agnostic and Task Specific Self-Supervised Learning from Speech with LeBenchmark.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

ON-TRAC' systems for the IWSLT 2021 low-resource speech translation and multilingual speech translation shared tasks.
Proceedings of the 18th International Conference on Spoken Language Translation, 2021

<i>LeBenchmark</i>: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Mesures de confiance et traitement automatique de la parole.
, 2021

FlauBERT : des modèles de langue contextualisés pré-entraînés pour le français (FlauBERT : Unsupervised Language Model Pre-training for French).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Reconnaissance de parole beatboxée à l'aide d'un système HMM-GMM inspiré de la reconnaissance automatique de la parole (BEATBOX SOUNDS RECOGNITION USING A SPEECH-DEDICATED HMM-GMM BASED SYSTEM 1 Human beatboxing is a vocal art making use of speech organs to produce percussive sounds and imitate musical instruments).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Providing Semantic Knowledge to a Set of Pictograms for People with Disabilities: a Set of Links between WordNet and Arasaac: Arasaac-WN.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

FlauBERT: Unsupervised Language Model Pre-training for French.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020.
Proceedings of the 17th International Conference on Spoken Language Translation, 2020

Towards Automatic Captioning of University Lectures for French students who are Deaf.
Proceedings of the ASSETS '20: The 22nd International ACM SIGACCESS Conference on Computers and Accessibility, 2020

Evaluation of the acceptability and usability of Augmentative and Alternative Communication (ACC) tools: the example of Pictogram grid communication systems with voice output.
Proceedings of the ASSETS '20: The 22nd International ACM SIGACCESS Conference on Computers and Accessibility, 2020

Making Emergency Calls More Accessible to Older Adults Through a Hands-free Speech Interface in the House.
ACM Trans. Access. Comput., 2019

Sense Vocabulary Compression through the Semantic Knowledge of WordNet for Neural Word Sense Disambiguation.
Proceedings of the 10th Global Wordnet Conference, 2019

Compression de vocabulaire de sens grâce aux relations sémantiques pour la désambiguïsation lexicale (Sense Vocabulary Compression through Semantic Knowledge for Word Sense Disambiguation).
Proceedings of the Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Volume I : Articles longs, 2019

Apporter des connaissances sémantiques à un jeu de pictogrammes destiné à des personnes en situation de handicap : Un ensemble de liens entre Princeton WordNet et Arasaac, Arasaac-WN (Giving semantic knowledge to a set of pictograms for people with disabilities : a set of links between WordNet and Arasaac, Arasaac-WN ).
Proceedings of the Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Volume IV : Démonstrations, 2019

The LIG system for the English-Czech Text Translation Task of IWSLT 2019.
Proceedings of the 16th International Conference on Spoken Language Translation, 2019

Context-Aware Voice-Based Interaction in Smart Home - VocADom@A4H Corpus Collection and Empirical Assessment of Its Usefulness.
Proceedings of the 2019 IEEE Intl Conf on Dependable, 2019

Automatic quality estimation for speech translation using joint ASR and MT features.
Mach. Transl., 2018

Distant speech processing for smart home: comparison of ASR approaches in scattered microphone network for voice command.
Int. J. Speech Technol., 2018

Improving the Coverage and the Generalization Ability of Neural Word Sense Disambiguation through Hypernymy and Hyponymy Relationships.
CoRR, 2018

Approche supervisée à base de cellules LSTM bidirectionnelles pour la désambiguïsation lexicale (LSTM Based Supervised Approach for Word Sense Disambiguation).
Proceedings of the Actes de la Conférence, 2018

UFSAC: Unification of Sense Annotated Corpora and Tools.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

ASR Performance Prediction on Unseen Broadcast Programs Using Convolutional Neural Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Analyzing Learned Representations of a Deep ASR Performance Prediction Model.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Find the errors, get the better: Enhancing machine translation via word confidence estimation.
Nat. Lang. Eng., 2017

Uniformisation de corpus anglais annotés en sens (Unification of sense annotated English corpora for word sense disambiguation).
Proceedings of the Actes des 24ème Conférence sur le Traitement Automatique des Langues Naturelles. Orléans, France, June 26-30, 2017 - Volume 3, 2017

Représentation vectorielle de sens pour la désambiguïsation lexicale à base de connaissances (Sense Embeddings in Knowledge-Based Word Sense Disambiguation).
Proceedings of the Actes des 24ème Conférence sur le Traitement Automatique des Langues Naturelles. Orléans, France, June 26-30, 2017, Volume 2, 2017

Traitement des Mots Hors Vocabulaire pour la Traduction Automatique de Document OCRisés en Arabe (This article presents a new system that automatically translates images of arabic documents).
Proceedings of the Actes des 24ème Conférence sur le Traitement Automatique des Langues Naturelles, 2017

Disentangling ASR and MT Errors in Speech Translation.
Proceedings of Machine Translation Summit XVI, Volume 1: Research Track, 2017

Sense Embeddings in Knowledge-Based Word Sense Disambiguation.
Proceedings of the IWCS 2017 - 12th International Conference on Computational Semantics - Short papers, Montpellier, France, September 19, 2017

Automatic Quality Assessment for Speech Translation Using Joint ASR and MT Features.
CoRR, 2016

Acquisition et reconnaissance automatique d'expressions et d'appels vocaux dans un habitat. (Acquisition and recognition of expressions and vocal calls in a smart home).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 1 : JEP, 2016

The CIRDO Corpus: Comprehensive Audio/Video Database of Domestic Falls of Elderly People.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

CirdoX: an on/off-line multisource speech and sound analysis software.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Joint ASR and MT Features for Quality Estimation in Spoken Language Translation.
Proceedings of the 13th International Conference on Spoken Language Translation, 2016

Better Evaluation of ASR in Speech Translation Context Using Word Embeddings.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

On Distant Speech Recognition for Home Automation.
Proceedings of the Smart Health - Open Problems and Future Challenges, 2015

Evaluation of a Context-Aware Voice Interface for Ambient Assisted Living: Qualitative User Study vs. Quantitative System Evaluation.
ACM Trans. Access. Comput., 2015

Towards accurate predictors of word quality for Machine Translation: Lessons learned on French-English and English-Spanish systems.
Data Knowl. Eng., 2015

Utilisation de mesures de confiance pour améliorer le décodage en traduction de parole.
Proceedings of the Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2015

Speech and speaker recognition for home automation: Preliminary results.
Proceedings of the International Conference on Speech Technology and Human-Computer Dialogue, 2015

Merging of Native and Non-native Speech for Low-resource Accented ASR.
Proceedings of the Statistical Language and Speech Processing, 2015

Recognition of Distress Calls in Distant Speech Setting: a Preliminary Experiment in a Smart Home.
Proceedings of the 6th Workshop on Speech and Language Processing for Assistive Technologies, 2015

An open-source toolkit for word-level confidence estimation in machine translation.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

Ant colony algorithm applied to automatic speech recognition graph decoding.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Using resources from a closely-related language to develop ASR for a very under-resourced language: a case study for iban.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Spoken language translation graphs re-decoding using automatic quality assessment.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

LIG System for Word Level QE task at WMT14.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Using closely-related language to build an ASR for a very under-resourced language: Iban.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

The Sweet-Home speech and multimodal corpus for home automation interaction.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Word confidence estimation for speech translation.
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, 2014

Multichannel automatic recognition of voice command in a multi-room smart home: an experiment involving seniors and users with visual impairment.
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Acoustic model merging using acoustic models from multilingual speakers for automatic speech recognition.
Proceedings of the 2014 International Conference on Asian Language Processing, 2014

An efficient two-pass decoder for SMT using word confidence estimation.
Proceedings of the 17th Annual conference of the European Association for Machine Translation, 2014

Word Confidence Estimation for SMT N-best List Re-ranking.
Proceedings of the Workshop on Humans and Computer-assisted Translation, 2014

Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding.
IEEE Trans. Speech Audio Process., 2013

LIG System for WMT13 QE Task: Investigating the Usefulness of Features in Word Confidence Estimation for MT.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Driven Decoding for machine translation (Vers un décodage guidé pour la traduction automatique) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2013

Experimental Evaluation of Speech Recognition Technologies for Voice-based Home Automation Control in a Smart Home.
Proceedings of the Fourth Workshop on Speech and Language Processing for Assistive Technologies, 2013

Word Confidence Estimation and Its Integration in Sentence Quality Estimation for Machine Translation.
Proceedings of the Knowledge and Systems Engineering, 2013

Evaluation of a real-time voice order recognition system from multiple audio channels in a home.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

The Sweet-Home project: Audio processing and decision making in smart home to improve well-being and reliance.
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013

Integrating imperfect transcripts into speech recognition systems for building high-quality corpora.
Comput. Speech Lang., 2012

Prédiction de l'indexabilité d'une transcription (Prediction of transcription indexability) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Reconnaissance d'ordres domotiques en conditions bruitées pour l'assistance à domicile (Recognition of Voice Commands by Multisource ASR and Noise Cancellation in a Smart Home Environment) [in French].
Proceedings of the JEP-TALN-RECITAL 2012, 2012

Reconnaissance automatique de la parole distante dans un habitat intelligent : méthodes multi-sources en conditions réalistes (Distant Speech Recognition in a Smart Home : Comparison of Several Multisource ASRs in Realistic Conditions) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

The LIG English to French machine translation system for IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Recognition of voice commands by multisource ASR and noise cancellation in a smart home environment.
Proceedings of the 20th European Signal Processing Conference, 2012

Sound Environment Analysis in Smart Home.
Proceedings of the Ambient Intelligence - Third International Joint Conference, 2012

The LIGA (LIG/LIA) Machine Translation System for WMT 2011.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Distant speech recognition for home automation: Preliminary experimental results in a smart home.
Proceedings of the 6th International Conference Speech Technology and Human-Computer Dialogue, 2011

LIG English-French spoken language translation system for IWSLT 2011.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Distant Speech Recognition in a Smart Home: Comparison of Several Multisource ASRs in Realistic Conditions.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

A segment-level confidence measure for Spoken Document Retrieval.
Proceedings of the IEEE International Conference on Acoustics, 2011

The sweet-home project: Audio technology in smart homes to improve well-being and reliance.
Proceedings of the 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2011

Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech.
EURASIP J. Audio Speech Music. Process., 2010

Transcriber Driving Strategies for Transcription Aid System.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Improving back-off models with bag of words and hollow-grams.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Semantic cache model driven speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2010

Combined low level and high level features for out-of-vocabulary word detection.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Reconnaissance automatique de la parole guidée par des transcriptions a priori. (driven decoding for speech recognition system combination).
PhD thesis, 2008

On-the-fly term spotting by phonetic filtering and request-driven decoding.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Generalized driven decoding for speech recognition system combination.
Proceedings of the IEEE International Conference on Acoustics, 2008

Text island spotting in large speech databases.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

System Combination by Driven Decoding.
Proceedings of the IEEE International Conference on Acoustics, 2007

Imperfect transcript driven speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006
