Utiliser l'explicabilité des modèles pour mettre en évidence les expressions genrées dans la parole.
astroECR : enrichissement d'un corpus astrophysique en entités nommées, coréférences et relations sémantiques.
A Dataset for Pharmacovigilance in German, French, and Japanese: Annotating Adverse Drug Reactions across Languages.
Enriching a Time-Domain Astrophysics Corpus with Named Entity, Coreference and Astrophysical Relationship Annotations.
Does the structure of textual content have an impact on language models for automatic summarization?
La pré-annotation automatique de textes cliniques comme support au dialogue avec les experts du domaine lors de la mise au point d'un schéma d'annotation.
Étude de méthodes d'augmentation de données pour la reconnaissance d'entités nommées en astrophysique.
Le traitement automatique des langues face à l'évolution des usages de la langue. (Natural Language Processing Facing the Language Uses Evolution).
Impact du français inclusif sur les outils du TAL (Impact of French Inclusive Language on NLP Tools).
Etude des stéréotypes genrés dans le théâtre français du XVIe au XIXe siècle à travers des plongements lexicaux (Studying gender stereotypes in French theater from XVIth to XIXth century through the use of lexical embeddings ).
Evaluating Tokenizers Impact on OOVs Representation with Transformers Models.
Classification de cas cliniques et évaluation automatique de réponses d'étudiants : présentation de la campagne DEFT 2021 (Clinical cases classification and automatic evaluation of student answers : Presentation of the DEFT 2021 Challenge).
Differential Evaluation: a Qualitative Analysis of Natural Language Processing System Behavior Based Upon Data Resistance to Processing.
Easy-to-use Combination of POS and BERT Model for Domain-Specific and Misspelled Terms.
Présentation de la campagne d'évaluation DEFT 2020 : similarité textuelle en domaine ouvert et extraction d'information précise dans des cas cliniques (Presentation of the DEFT 2020 Challenge : open domain textual similarity and precise information extraction from clinical cases ).
Inference Annotation of a Chinese Corpus for Opinion Mining.
Experiments from LIMSI at the French Named Entity Recognition Coarse-grained Task.
Automatic classification of free-text medical causes from death certificates for reactive mortality surveillance in France.
Recherche et extraction d'information dans des cas cliniques. Présentation de la campagne d'évaluation DEFT 2019 (Information Retrieval and Information Extraction from Clinical Cases).
Corpus annoté de cas cliniques en français (Annotated corpus with clinical cases in French).
Community Perspective on Replicability in Natural Language Processing.
Initial Experiments for Pharmacovigilance Analysis in Social Media Using Summaries of Product Characteristics.
A New Approach to Compare the Performance of Two Classification Methods of Causes of Death for Timely Surveillance in France.
Clinical Case Reports for NLP.
A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annOtated Text corpus (MERLOT).
DEFT2018 : recherche d'information et analyse de sentiments dans des tweets concernant les transports en Île de France (DEFT2018 : Information Retrieval and Sentiment Analysis in Tweets about Public Transportation in Île de France Region ).
Simplification de schémas d'annotation : un aller sans retour ? (Annotation scheme simplification : a one way trip with no return ?).
Three Dimensions of Reproducibility in Natural Language Processing.
Traitement automatique de la langue biomédicale au LIMSI (Biomedical language processing at LIMSI).
Generating a Training Corpus for OCR Post-Correction Using Encoder-Decoder Model.
CLEF eHealth 2017 Multilingual Information Extraction task Overview: ICD10 Coding of Death Certificates in English and French.
Reproducibility in Biomedical Natural Language Processing.
Une catégorisation de fins de lignes non-supervisée (End-of-line classification with no supervision).
LIMSI at SemEval-2016 Task 12: machine-learning and temporal information to identify clinical events and time expressions.
Identification of Drug-Related Medical Conditions in Social Media.
Controlled Propagation of Concept Annotations in Textual Corpora.
Text Segmentation of Digitized Clinical Texts.
Supervised classification of end-of-lines in clinical text with no manual annotation.
A Dataset for ICD-10 Coding of Death Certificates: Creation and Usage.
Detection of Text Reuse in French Medical Corpora.
Clinical Information Extraction at the CLEF eHealth Evaluation lab 2016.
Identification of Mentions and Relations between Bacteria and Biotope from PubMed Abstracts.
Replicability of Research in Biomedical Natural Language Processing: a pilot evaluation for a coding task.
Low-resource OCR error detection and correction in French Clinical Texts.
The contribution of co-reference resolution to supervised relation detection between bacteria and biotopes entities.
Combining glass box and black box evaluations in the identification of heart disease risk factors and their temporal relations from clinical records.
Médicaments qui soignent, médicaments qui rendent malades : étude des relations causales pour identifier les effets secondaires.
Étude des verbes introducteurs de noms de médicaments dans les forums de santé.
Identification de facteurs de risque pour des patients diabétiques à partir de comptes-rendus cliniques par des approches hybrides.
CLEF eHealth Evaluation Lab 2015 Task 1b: Clinical Named Entity Recognition.
Overview of the CLEF eHealth Evaluation Lab 2015.
Is it possible to recover personal health information from an automatically de-identified corpus of French EHRs?
De-identification of clinical notes in French: towards a protocol for reference corpus development.
Automatic Analysis of Scientific and Literary Texts. Presentation and Results of the DEFT2014 Text Mining Challenge (Analyse automatique de textes littéraires et scientifiques : présentation et résultats du défi fouille de texte DEFT2014) [in French].
Human annotation of ASR error regions: Is "gravity" a sharable concept for human annotators?
Biomedical entity extraction using machine-learning based approaches.
Morpho-Syntactic Study of Errors from Speech Recognition System.
Annotation of specialized corpora using a comprehensive entity and relation scheme.
Use of unsupervised word classes for entity recognition: Application to the detection of disorders in clinical reports.
Disease and Disorder Template Filling using Rule-based and Statistical Approaches.
How to de-identify a large clinical corpus in 10 days.
Automatic Content Extraction for Designing a French Clinical Corpus.
Optimizing annotation efforts to build reliable annotated corpora for training statistical models.
Anonymisation de documents cliniques : performances et limites des méthodes symboliques et par apprentissage statistique. (Clinical Records De-Identification: Performances and Limits of Rule-based and Machine-Learning based Approaches).
Eventual situations for timeline extraction from clinical reports.
Studying frequency-based approaches to process lexical simplification (Approches à base de fréquences pour la simplification lexicale) [in French].
Automatic De-Identification of French Clinical Records: Comparison of Rule-Based and Machine-Learning Approaches.
Building A Contrasting Taxa Extractor for Relation Identification from Assertions: BIOlogical Taxonomy & Ontology Phrase Extraction System.
Automatic Named Entity Pre-annotation for Out-of-domain Human Annotation.
Indexation libre et contrôlée d'articles scientifiques. Présentation et résultats du défi fouille de textes DEFT2012 (Controlled and free indexing of scientific papers. Presentation and results of the DEFT2012 text-mining challenge) [in French].
ANNLOR: A Naïve Notation-system for Lexical Outputs Ranking.
Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign.
Detecting negation of medical problems in French clinical notes.
Manual Corpus Annotation: Giving Meaning to the Evaluation Metrics.
Structured Named Entities in two distinct press corpora: Contemporary Broadcast News and Old Newspapers.
Une approche à plusieurs étapes pour anonymiser des documents médicaux.
Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification.
Extraction d'informations médicales au LIMSI (Medical information extraction at LIMSI).
Accès au contenu sémantique en langue de spécialité : extraction des prescriptions et concepts médicaux (Accessing the semantic content in a specialized language: extracting prescriptions and medical concepts).
Structured and Extended Named Entity Evaluation in Automatic Speech Transcriptions.
Handling Outlandish Occurrences: Using Rules and Lexicons for Correcting NLP Articles.
Proposal for an Extension of Traditional Named Entities: From Guidelines to Evaluation, an Overview.
Extracting medical information from narrative patient records: the case of medication-related information.
Extracting Medication Information from French Clinical Texts.
A Corpus for Studying Full Answer Justification.
DEFT'07 : une campagne d'évaluation en fouille d'opinion.
Testing Tactics to Localize De-Identification.
Certification and Cleaning up of a Text Corpus: Towards an Evaluation of the "Grammatical" Quality of a Corpus.
Recycling an Information Extraction System to Automatically Produce Semantic Annotations for the Web.
