Orevaoghene Ahia
According to our database1,
Orevaoghene Ahia
authored at least 27 papers
between 2019 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization.
CoRR, 2024
DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages.
CoRR, 2024
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
DIALECTBENCH: An NLP Benchmark for Dialects, Varieties, and Closely-Related Languages.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
2023
CoRR, 2023
Proceedings of the The 12th Joint Conference on Lexical and Computational Semantics, 2023
AfriWOZ: Corpus for Exploiting Cross-Lingual Transfer for Dialogue Generation in Low-Resource, African Languages.
Proceedings of the International Joint Conference on Neural Networks, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
That was the last straw, we need more: Are Translation Systems Sensitive to Disambiguating Context?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
2022
Trans. Assoc. Comput. Linguistics, 2022
Ìtàkúròso: Exploiting Cross-Lingual Transferability for Natural Language Generation of Dialogues in Low-Resource, African Languages.
CoRR, 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
2021
Trans. Assoc. Comput. Linguistics, 2021
The Low-Resource Double Bind: An Empirical Study of Pruning for Low-Resource Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
2020
Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages.
CoRR, 2020
Towards Supervised and Unsupervised Neural Machine Translation Baselines for Nigerian Pidgin.
Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020
Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020
Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
2019
PidginUNMT: Unsupervised Neural Machine Translation from West African Pidgin to English.
CoRR, 2019