Éric Villemonte de la Clergerie

Orcid: 0000-0001-6428-9219

Affiliations:
  • INRIA, ALPAGE


According to our database1, Éric Villemonte de la Clergerie authored at least 99 papers between 1990 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck.
CoRR, 2024

PatentEval: Understanding Errors in Patent Generation.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Headless Language Models: Learning without Predicting with Contrastive Weight Tying.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Translate your Own: a Post-Editing Experiment in the NLP domain.
Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

Anisotropy Is Inherent to Self-Attention in Transformers.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

CamemBERT-bio: Leveraging Continual Pre-training for Cost-Effective Models on French Biomedical Data.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

On the Scaling Laws of Geographical Representation in Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
CamemBERT-bio: a Tasty French Language Model Better for your Health.
CoRR, 2023

Is Anisotropy Inherent to Transformers?
CoRR, 2023

CamemBERT-bio : Un modèle de langue français savoureux et meilleur pour la santé.
Proceedings of the Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles, TALN 2023 - Volume 1 : travaux de recherche originaux, 2023

Constitution de sous-fils de conversations d'emails.
Proceedings of the Actes de CORIA-TALN 2023. Actes de la 18e Conférence en Recherche d'Information et Applications, 2023

Annotation d'entités cliniques en utilisant les Larges Modèles de Langue.
Proceedings of the Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles, TALN 2023 - Volume 1 : travaux de recherche originaux, 2023

MaTOS: Traduction automatique pour la science ouverte.
Proceedings of the Actes de CORIA-TALN 2023. Actes de l'atelier "Analyse et Recherche de Textes Scientifiques", 2023

Annotate French Clinical Data Using Large Language Model Predictions.
Proceedings of the 11th IEEE International Conference on Healthcare Informatics, 2023

Large Language Models as Instructors: A Study on Multilingual Clinical Entity Extraction.
Proceedings of the 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, 2023

2022
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling.
CoRR, 2022

MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

MANTa: Efficient Gradient-Based Tokenization for End-to-End Robust Language Modeling.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Rethinking Automatic Evaluation in Sentence Simplification.
CoRR, 2021

2020
Multilingual Unsupervised Sentence Simplification.
CoRR, 2020

Les modèles de langue contextuels Camembert pour le français : impact de la taille et de l'hétérogénéité des données d'entrainement (C AMEM BERT Contextual Language Models for French: Impact of Training Data Size and Heterogeneity ).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Controllable Sentence Simplification.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Reconstructing the gendered division of labor in the French textile trades. Distant reading of primary qualitative sources with NLP tools (18th century-beginning of the 20th century).
Proceedings of the 15th Annual International Conference of the Alliance of Digital Humanities Organizations, 2020

Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts.
Proceedings of the 2020 IEEE International Conference on Big Data (IEEE BigData 2020), 2020

CamemBERT: a Tasty French Language Model.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Reference-less Quality Estimation of Text Simplification Systems.
CoRR, 2019

INRIA at SemEval-2019 Task 9: Suggestion Mining Using SVM with Handcrafted Features.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

2018
Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

ANCOR-AS: Enriching the ANCOR Corpus with Syntactic Annotations.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

The Time-Us project. Creating gold data to understand the gender gap in the French textile trades (17th-20th century).
Proceedings of the 13th Annual International Conference of the Alliance of Digital Humanities Organizations, 2018

ELMoLex: Connecting ELMo and Lexicon Features for Dependency Parsing.
Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Brussels, Belgium, October 31, 2018

2017
Apports des analyses syntaxiques pour la détection automatique de mentions dans un corpus de français oral (Experiences in using deep and shallow parsing to detect entity mentions in oral French).
Proceedings of the Actes des 24ème Conférence sur le Traitement Automatique des Langues Naturelles. Orléans, France, June 26-30, 2017, Volume 2, 2017

The ParisNLP entry at the ConLL UD Shared Task 2017: A Tale of a #ParsingTragedy.
Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, 2017

2016
Accurate Deep Syntactic Parsing of Graphs: The Case of French.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
Préface.
Trait. Autom. des Langues, 2015

Because Syntax Does Matter: Improving Predicate-Argument Structures Parsing with Syntactic Features.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

2014
Préface.
Trait. Autom. des Langues, 2014

Playing with parsers (Jouer avec des analyseurs syntaxiques) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014

Alpage: Transition-based Semantic Graph Parsing with Syntactic Features.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Towards an environment for the production and the validation of lexical semantic resources.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Deep Syntax Annotation of the Sequoia French Treebank.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

2013
Préface.
Trait. Autom. des Langues, 2013

Improving a symbolic parser through partially supervised learning.
Proceedings of The 13th International Conference on Parsing Technologies, 2013

Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages.
Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages, 2013

Exploring beam-based shift-reduce dependency parsing with DyALog: Results from the SPMRL 2013 shared task.
Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages, 2013

2012
A linguistically-motivated 2-stage Tree to Graph Transformation.
Proceedings of the 11th International Workshop on Tree Adjoining Grammars and Related Formalisms, 2012

Evaluating and improving syntactic lexica by plugging them within a parser.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Boosting the Coverage of a Semantic Lexicon by Automatically Extracted Event Nominalizations.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011
Évaluation de lexiques syntaxiques par leur intégartion dans l'analyseur syntaxiques FRMG
CoRR, 2011

Modelling Intermolecular Structures and Defining Ambiguity in Gene Sequences using Matrix Insertion-Deletion Systems.
Proceedings of the Biology, Computation and Linguistics - New Interdisciplinary Paradigms, 2011

2010
Exploitation de résultats d'analyse syntaxique pour extraction semi-supervisée des chemins de relations.
Proceedings of the Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles. Articles courts, 2010

Convertir des dérivations TAG en dépendances.
Proceedings of the Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2010

Building factorized TAGs with meta-grammars.
Proceedings of the 10th International Workshop on Tree Adjoining Grammar and Related Frameworks, 2010

PASSAGE Syntactic Representation: a Minimal Common Ground for Evaluation.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

2009
Producción eficiente de recursos lingüísticos: proyecto Victoria.
Proces. del Leng. Natural, 2009

Trouver et confondre les coupables : un processus sophistiqué de correction de lexique.
Proceedings of the Actes de la 16ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2009

Towards Efficient Production of Linguistic Resources: the Victoria Project.
Proceedings of the Recent Advances in Natural Language Processing, 2009

Extracting and Visualizing Quotations from News Wires.
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2009

2008
Error Mining on Syntactic Parser Output.
Trait. Autom. des Langues, 2008

Extensión y corrección semi-automática de léxicos morfo-sintácticos.
Proces. del Leng. Natural, 2008

PASSAGE: from French Parser Evaluation to Large Sized Treebank.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Large Scale Production of Syntactic Annotations to Move Forward.
Proceedings of the workshop on Cross-Framework and Cross-Domain Parser Evaluation@COLING 2008, 2008

Computer Aided Correction and Extension of a Syntactic Wide-Coverage Lexicon.
Proceedings of the COLING 2008, 2008

Mining conceptual graphs for knowledge acquisition.
Proceedings of the Proceeding of the 2nd ACM workshop on Improving Non English Web Searching, 2008

2007
Confondre le coupable : corrections d'un lexique suggérées par une grammaire.
Proceedings of the Actes de la 14ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2007

Large-Scale Knowledge Acquisition from Botanical Texts.
Proceedings of the Natural Language Processing and Information Systems, 2007

Mining Parsing Results for Lexical Correction: Toward a Complete Correction Process of Wide-Coverage Lexicons.
Proceedings of the Human Language Technology. Challenges of the Information Society, 2007

From Text to Knowledge.
Proceedings of the Computer Aided Systems Theory, 2007

2006
Trouver le coupable : Fouille d'erreurs sur des sorties d'analyseurs syntaxiques.
Proceedings of the Actes de la 13ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2006

The Lefff 2 syntactic lexicon for French: architecture, acquisition, use.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Error Mining in Parsing Results.
Proceedings of the ACL 2006, 2006

2005
Comment obtenir plus des Méta-Grammaires.
Proceedings of the Actes de la 12ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2005

Chaînes de traitement syntaxique.
Proceedings of the Actes de la 12ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2005

From metagrammars to factorized TAG/TIG parsers.
Proceedings of the Ninth International Workshop on Parsing Technology, 2005

2004
Towards an International Standard on Feature Structure Representation.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

2002
Construire des analyseurs avec DyALog.
Proceedings of the Actes de la 9ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2002

Parsing MCS languages with Thread Automata.
Proceedings of the Sixth International Workshop on Tree Adjoining Grammar and Related Frameworks, 2002

Parsing Mildly Context-Sensitive Languages with Thread Automata.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

2001
Tabulation for Multi-Purpose Partial Parsing.
Grammars, 2001

Atelier ATOLL pour les grammaires d'arbres adjoints.
Proceedings of the Actes de la 8ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2001

Refining Tabular Parsers for TAGs.
Proceedings of the Language Technologies 2001: The Second Meeting of the North American Chapter of the Association for Computational Linguistics, 2001

A Formal Definition of Bottom-Up Embedded Push-Down Automata and Their Tabulation Technique.
Proceedings of the Logical Aspects of Computational Linguistics, 2001

Natural Language Tabular Parsing.
Proceedings of the Logic Programming, 17th International Conference, 2001

Guided Parsing of Range Concatenation Languages.
Proceedings of the Association for Computational Linguistic, 2001

2000
Tabulation of Automata for Tree-Adjoining Languages.
Grammars, 2000

Practical aspects in compiling tabular TAG parsers.
Proceedings of the Fifth International Workshop on Tree Adjoining Grammar and Related Frameworks, 2000

A redefinition of Embedded Push-Down Automata.
Proceedings of the Fifth International Workshop on Tree Adjoining Grammar and Related Frameworks, 2000

New Tabular Algorithms for Parsing.
Proceedings of the Sixth Internatonal Workshop on Parsing Technologies, 2000

1999
Tabular Algorithms for TAG Parsing.
Proceedings of the EACL 1999, 1999

1998
Information Flow in Tabular Interpretations for Generalized Push-Down Automata.
Theor. Comput. Sci., 1998

A tabular interpretation of bottom-up automata for TAG.
Proceedings of the Fourth International Workshop on Tree Adjoining Grammars and Related Frameworks, 1998

A Tabular Interpretation of a Class of 2-Stack Automata.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1996
Logical Aspects of Computational Linguistics: An Introduction.
Proceedings of the Logical Aspects of Computational Linguistics, 1996

1994
LPDA: Another look at Tabulation in Logic Programming.
Proceedings of the Logic Programming, 1994

1993
Layer Sharing: An Improved Structure-Sharing Framework.
Proceedings of the Conference Record of the Twentieth Annual ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, 1993

How to build quickly an efficient implementation of the domain Prop with DyALog.
Proceedings of the 5th Workshop on Logic Programming Environments (LPE 1993), 1993

1992
Subsumption-oriented Push-Down Automata.
Proceedings of the Programming Language Implementation and Logic Programming, 1992

1991
A Tool for Abstract Interpretation: Dynamic Programming.
Proceedings of the Actes JTASPEFL'91 (Bordeaux, 1991

1990
DyALog: une implantation des Clauses de Horn en Programmtion Dynamique.
Proceedings of the SPLT'90, 1990


  Loading...