Benjamin Muller
Affiliations:- Meta, NYC, USA
According to our database1,
Benjamin Muller
authored at least 23 papers
between 2018 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
On csauthors.net:
Bibliography
2024
CoRR, 2024
The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
CoRR, 2023
The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages.
Proceedings of the Eighth Conference on Machine Translation, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
In What Languages are Generative Language Models the Most Formal? Analyzing Formality Distribution across Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
2022
How Can We Make Language Models Better at Handling the Diversity and Variability of Natural Languages ? (Comment rendre les modèles de langue meilleurs face à la grande diversité et variabilité des langues ?).
PhD thesis, 2022
Inria-ALMAnaCH at WMT 2022: Does Transcription Help Cross-Script Machine Translation?
Proceedings of the Seventh Conference on Machine Translation, 2022
Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer.
Proceedings of the Transfer Learning for Natural Language Processing Workshop, 2022
Quand être absent de mBERT n'est que le commencement : Gérer de nouvelles langues à l'aide de modèles de langues multilingues (When Being Unseen from mBERT is just the Beginning : Handling New Languages With Multilingual Language Models).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022
2021
Cross-Lingual GenQA: A Language-Agnostic Generative Question Answering Approach for Open-Domain Question Answering.
CoRR, 2021
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021
First Align, then Predict: Understanding the Cross-Lingual Ability of Multilingual BERT.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021
2020
Can Multilingual Language Models Transfer to an Unseen Dialect? A Case Study on North African Arabizi.
CoRR, 2020
Les modèles de langue contextuels Camembert pour le français : impact de la taille et de l'hétérogénéité des données d'entrainement (C AMEM BERT Contextual Language Models for French: Impact of Training Data Size and Heterogeneity ).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
2019
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019
2018
Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, Brussels, Belgium, October 31, 2018