Benchmarking Linguistic Diversity of Large Language Models.
CoRR, 2024
Do Large Language Models Have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs.
CoRR, 2024
The Curious Decline of Linguistic Diversity: Training Language Models on Synthetic Text.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
Automatic Analysis of Substantiation in Scientific Peer Reviews.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Political Communities on Twitter: Case Study of the 2022 French Presidential Election.
Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences, 2022
Questioning the Validity of Summarization Datasets and Improving Their Factual Consistency.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Controlling of lower-order dead system by implementing adaptive RST algorithm.
Int. J. Model. Identif. Control., 2021
NLP Research and Resources at DaSciM, Ecole Polytechnique.
CoRR, 2021
How COVID-19 Is Changing Our Language : Detecting Semantic Shift in Twitter Word Embeddings.
CoRR, 2021
BERTweetFR : Domain Adaptation of Pre-Trained Language Models for French Tweets.
Proceedings of the Seventh Workshop on Noisy User-generated Text, 2021