David Ifeoluwa Adelani

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models.

[BibT_eX]

[DOI]

Eduardo Sánchez

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects.

[BibT_eX]

[DOI]

Daud Abolade

Noah A. Smith

Yulia Tsvetkov

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects.

[BibT_eX]

[DOI]

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

ÌròyìnSpeech: A Multi-purpose Yorùbá Speech Corpus.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Mitigating Translationese in Low-resource Languages: The Storyboard Approach.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

Consultative engagement of stakeholders toward a roadmap for African language technologies.

[BibT_eX]

[DOI]

Masabata Mokgesi-Selinga

Frederick R. Apina

Angela Thandizwe Mthembu

Arshath Ramkilowan

Babatunde Oladimeji

Patterns, August, 2023

AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages.

[BibT_eX]

[DOI]

Jiayi Wang

Oluwabusayo Olufunke Awoyomi

Lyse Naomi Wamba Momo

Onyekachi Raphael Ogbu

Sam Brian

Verrah Akinyi Otiende

Chinedu Emmanuel Mbonu

Sakayo Toadoum Sari

Pontus Stenetorp

CoRR, 2023

How good are Large Language Models on African Languages?

[BibT_eX]

[DOI]

Jessica Ojo

Kelechi Ogueji

Pontus Stenetorp

CoRR, 2023

YORC: Yoruba Reading Comprehension dataset.

[BibT_eX]

[DOI]

Aremu Anuoluwapo

CoRR, 2023

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages.

[BibT_eX]

[DOI]

Albert Njoroge Kahira

Atnafu Lambebo Tonja

Akari Asai

Tunde Oluwaseyi Ajayi

Andre Niyongabo Rubungo

Daniel A. Ajisafe

Emeka Felix Onwuegbuzia

Chinedu Emmanuel Mbonu

CoRR, 2023

MasakhaNEWS: News Topic Classification for African languages.

[BibT_eX]

[DOI]

Marek Masiak

Israel Abebe Azime

Jesujoba Oluwadara Alabi

Atnafu Lambebo Tonja

Christine Mwase

Odunayo Ogundepo

Doreen Nixdorf

Tunde Oluwaseyi Ajayi

Tatiana Moteu Ngoli

Brian Odhiambo

Nnaemeka C. Obiefuna

Saheed Abdullahi Salahudeen

Oluwabusayo Olufunke Awoyomi

Habiba Abdulganiy Kailani

Tolulope Anu Adelani

Onyekachi Raphael Ogbu

Abdulmejid Tuni Johar

CoRR, 2023

E KÚ [MASK]: Integrating Yorùbá cultural greetings into machine translation.

[BibT_eX]

[DOI]

Idris Akinade

Clement Odoje

CoRR, 2023

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages.

[BibT_eX]

[DOI]

CoRR, 2023

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval).

[BibT_eX]

[DOI]

Idris Abdulmumin

Seid Muhie Yimam

Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Improving Language Plasticity via Pretraining with Active Forgetting.

[BibT_eX]

[DOI]

Yihong Chen

Kelly Marchisio

Roberta Raileanu

Pontus Lars Erik Saito Stenetorp

Sebastian Riedel

Mikel Artetxe

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MasakhaNEWS: News Topic Classification for African languages.

[BibT_eX]

[DOI]

Doreen Nixdorf

Nnaemeka C. Obiefuna

Muhidin Mohamed

Saheed Abdullahi Salahudeen

Teshome Mulugeta Ababu

Oluwabusayo Olufunke Awoyomi

Onyekachi Raphael Ogbu

Abdulmejid Tuni Johar

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Better Quality Pre-training Data and T5 Models for African Languages.

[BibT_eX]

[DOI]

Mofetoluwa Adeyemi

Orevaoghene Ahia

Odunayo Ogundepo

Jimmy Lin

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Cross-lingual Open-Retrieval Question Answering for African Languages.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

MphayaNER: Named Entity Recognition for Tshivenda.

[BibT_eX]

[DOI]

Rendani Mbuvha

Tendani Mutavhatsindi

Tshimangadzo Rakhuhu

Aluwani Mauda

Tshifhiwa Joshua Maumela

Proceedings of the 4th Workshop on African Natural Language Processing, 2023

ε kú : Integrating YorùBá Cultural greetings into Machine Translation.

[BibT_eX]

[DOI]

Idris Akinade

Jesujoba Oluwadara Alabi

Clement Odoje

Proceedings of the 4th Workshop on African Natural Language Processing, 2023

MasakhaNEWS: News Topic Classification for African languages.

[BibT_eX]

[DOI]

Proceedings of the 4th Workshop on African Natural Language Processing, 2023

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification.

[BibT_eX]

[DOI]

Jing Peng

Anna Feldman

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African languages.

[BibT_eX]

[DOI]

Cheikh M. Bamba Dione

Victoire Memdjokam Koagne

Andiswa Bukula

Rooweither Mabuya

Allahsera Auguste Tapo

Edwin Munkoh-Buabeng

Fatoumata Ouoba Kabore

Elvis Mboning Tchiaze

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.

[BibT_eX]

[DOI]

CoRR, 2022

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model.

[BibT_eX]

[DOI]

CoRR, 2022

Task-Adaptive Pre-Training for Boosting Learning With Noisy Labels: A Study on Text Classification for African Languages.

[BibT_eX]

[DOI]

Dawei Zhu

Fangzhou Zhai

CoRR, 2022

yosm: A new yoruba sentiment corpus for movie reviews.

[BibT_eX]

[DOI]

Anna Feldman

CoRR, 2022

Multilingual Language Model Adaptive Fine-Tuning: A Study on African Languages.

[BibT_eX]

[DOI]

Marius Mosbach

CoRR, 2022

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis.

[BibT_eX]

[DOI]

Saheed Abdullahi Salahudeen

Aremu Anuoluwapo

Alípio Jeorge

Pavel Brazdil

CoRR, 2022

Findings of the WMT'22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages.

[BibT_eX]

[DOI]

Md Mahfuz Ibn Alam

Antonios Anastasopoulos

Proceedings of the Seventh Conference on Machine Translation, 2022

TOKEN Is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models.

[BibT_eX]

[DOI]

Thomas Kleinbauer

Albert Villanova del Moral

Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset.

[BibT_eX]

[DOI]

Teven Le Scao

Leandro von Werra

Chenghao Mou

Eduardo González Ponferrada

Angelina McMillan-Major

Alexandra Sasha Luccioni

Yacine Jernite

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

MCSE: Multimodal Contrastive Learning of Sentence Embeddings.

[BibT_eX]

[DOI]

Miaoran Zhang

Marius Mosbach

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis.

[BibT_eX]

[DOI]

Aremu Anuoluwapo

Idris Abdulmumin

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition.

[BibT_eX]

[DOI]

Peter Nabende

Cheikh M. Bamba Dione

Andiswa Bukula

Rooweither Mabuya

Fatoumata Ouoba Kabore

Victoire Memdjokam Koagne

Allahsera Auguste Tapo

Joyce Nakatumba-Nabende

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Few-Shot Pidgin Text Adaptation via Contrastive Fine-Tuning.

[BibT_eX]

[DOI]

Ernie Chang

Vera Demberg

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-Tuning.

[BibT_eX]

[DOI]

Marius Mosbach

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?

[BibT_eX]

[DOI]

Ruisi Su

Arya McCarthy

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification.

[BibT_eX]

[DOI]

Dawei Zhu

Fangzhou Zhai

Proceedings of the Third Workshop on Insights from Negative Results in NLP, 2022

2021

MasakhaNER: Named Entity Recognition for African Languages.

[BibT_eX]

[DOI]

Joyce Nakatumba-Nabende

Rubungo Andre Niyongabo

Kelechi Ogueji

Thierno Ibrahima Diop

Trans. Assoc. Comput. Linguistics, 2021

AI4D - African Language Program.

[BibT_eX]

[DOI]

Thierno Ibrahima Diop

Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

MENYO-20k: A Multi-domain English-Yorùbá Corpus for Machine Translation and Domain Adaptation.

[BibT_eX]

[DOI]

Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

The Effect of Domain and Diacritics in Yoruba-English Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 18th Biennial Machine Translation Summit - Volume 1: Research Track, 2021

Preventing Author Profiling through Zero-Shot Multilingual Back-Translation.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020

Estimating community feedback effect on topic choice in social media with predictive modeling.

[BibT_eX]

[DOI]

Ryota Kobayashi

Ingmar Weber

Przemyslaw A. Grabowicz

EPJ Data Sci., 2020

Robust Differentially Private Training of Deep Neural Networks.

[BibT_eX]

[DOI]

Thomas Kleinbauer

CoRR, 2020

Improving Yorùbá Diacritic Restoration.

[BibT_eX]

[DOI]

Iroro Orife

Timi E. Fasubaa

Victor Williamson

Wuraola Fisayo Oyewusi

Olamilekan Wahab

Kola Tubosun

Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020

Distant Supervision and Noisy Label Learning for Low Resource Named Entity Recognition: A Study on Hausa and Yorùbá.

[BibT_eX]

[DOI]

Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020

Unsupervised Pidgin Text Generation By Pivoting English Data and Self-Training.

[BibT_eX]

[DOI]

Ernie Chang

Xiaoyu Shen

Vera Demberg

Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020

Investigating the Impact of Pre-trained Word Embeddings on Memorization in Neural Networks.

[BibT_eX]

[DOI]

Aleena Thomas

Aditya Mogadala

Proceedings of the Text, Speech, and Dialogue, 2020

Massive vs. Curated Embeddings for Low-Resourced Languages: the Case of Yorùbá and Twi.

[BibT_eX]

[DOI]

Kwabena Amponsah-Kaakyire

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Privacy Guarantees for De-Identifying Text Transformations.

[BibT_eX]

[DOI]

Thomas Kleinbauer

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages.

[BibT_eX]

[DOI]

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-Based Detection.

[BibT_eX]

[DOI]

Proceedings of the Advanced Information Networking and Applications, 2020

2019

Massive vs. Curated Word Embeddings for Low-Resourced Languages. The Case of Yorùbá and Twi.

[BibT_eX]

[DOI]

Kwabena Amponsah-Kaakyire

CoRR, 2019

Demographic Inference and Representative Population Estimates from Multilingual Social Media Data.

[BibT_eX]

[DOI]

Zijian Wang

Scott A. Hale

Przemyslaw A. Grabowicz

Timo Hartmann

Fabian Flöck

David Jurgens

Proceedings of the World Wide Web Conference, 2019

2016

Enhancing the reusability and interoperability of artificial neural networks with DEVS modeling and simulation.

[BibT_eX]

[DOI]