Atnafu Lambebo Tonja

Bharathi Raja Chakravarthi

José Antonio García-Díaz

Selen Bozkurt

Hector G. Ceballos

Rafael Valencia-García

Grigori Sidorov

Luis Alfonso Ureña López

Alexander F. Gelbukh

Salud María Jiménez-Zafra

Proces. del Leng. Natural, 2024

InkubaLM: A small language model for low-resource African languages.

[BibT_eX]

[DOI]

CoRR, 2024

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark.

[BibT_eX]

[DOI]

David Romero

Chenyang Lyu

Haryo Akbarianto Wibowo

Jan Christian Blaise Cruz

Henok Biadglign Ademtew

Hernán Maina

Holy Lovenia

Israel Abebe Azime

Jay P. Gala

Jiahui Geng

Jesús-Germán Ortiz-Barajas

Jinheon Baek

Jocelyn Dunstan

Laura Alonso Alemany

Kumaranage Ravindu Yasas Nagasinghe

Luciana Benotti

Luis Fernando D'Haro

Marcelo Viridiano

Marcos Estecha-Garitagoitia

Maria Camila Buitrago Cabrera

Mario Rodríguez-Cantelar

Mélanie Jouitteau

Mihail Mihaylov

Mohamed Fazli Mohamed Imam

Muhammad Farid Adilazuarda

Munkhjargal Gochoo

Munkh-Erdene Otgonbold

Tiago Timponi Torrent

Toqeer Ehsan

Vladimir Araujo

Yova Kementchedjhieva

CoRR, 2024

EthioMT: Parallel Corpus for Low-resource Ethiopian Languages.

[BibT_eX]

[DOI]

CoRR, 2024

EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation.

[BibT_eX]

[DOI]

Ebrahim Chekol Jibril

Michael Melese Woldeyohannis

CoRR, 2024

NLP Progress in Indigenous Latin American Languages.

[BibT_eX]

[DOI]

Fazlourrahman Balouchzahi

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

The Zeno's Paradox of 'Low-Resource' Languages.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and Generative Datasets.

[BibT_eX]

[DOI]

Walelign Tewabe Sewunetie

Seid Muhie Yimam

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

EthioLLM: Multilingual Large Language Models for Ethiopian Languages with Task Evaluation.

[BibT_eX]

[DOI]

Ebrahim Chekol Jibril

Michael Melese Woldeyohannis

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023

AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR.

[BibT_eX]

[DOI]

Tobi Olatunji

Tejumade Afonja

Aditya Yadavalli

Sahib Singh

Trans. Assoc. Comput. Linguistics, 2023

First Attempt at Building Parallel Corpora for Machine Translation of Northeast India's Very Low-Resource Languages.

[BibT_eX]

[DOI]

CoRR, 2023

Adapting Pretrained ASR Models to Low-resource Clinical Speech using Epistemic Uncertainty-based Data Selection.

[BibT_eX]

[DOI]

CoRR, 2023

Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts.

[BibT_eX]

[DOI]

CoRR, 2023

Enhancing Translation for Indigenous Languages: Experiments with Multilingual Models.

[BibT_eX]

[DOI]

CoRR, 2023

Parallel Corpus for Indigenous Language Translation: Spanish-Mazatec and Spanish-Mixtec.

[BibT_eX]

[DOI]

Christian Efraín Maldonado Sifuentes

David Alejandro Mendoza Castillo

Olga Kolesnikova

Noé Alejandro Castro-Sánchez

Grigori Sidorov

Alexander F. Gelbukh

CoRR, 2023

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages.

[BibT_eX]

[DOI]

Albert Njoroge Kahira

Abraham Toluwase Owodunni

Akintunde Oladipo

Iyanuoluwa Shode

Akari Asai

Tunde Oluwaseyi Ajayi

Andre Niyongabo Rubungo

Daniel A. Ajisafe

Emeka Felix Onwuegbuzia

Chinedu Emmanuel Mbonu

CoRR, 2023

The African Stopwords project: curating stopwords for African languages.

[BibT_eX]

[DOI]

CoRR, 2023

MasakhaNEWS: News Topic Classification for African languages.

[BibT_eX]

[DOI]

Marek Masiak

Israel Abebe Azime

Jesujoba Oluwadara Alabi

Christine Mwase

Odunayo Ogundepo

Akintunde Oladipo

Doreen Nixdorf

Abraham Toluwase Owodunni

Tunde Oluwaseyi Ajayi

Tatiana Moteu Ngoli

Brian Odhiambo

Nnaemeka C. Obiefuna

Saheed Abdullahi Salahudeen

Oluwabusayo Olufunke Awoyomi

Iyanuoluwa Shode

Tolulope Anu Adelani

Habiba Abdulganiy Kailani

Onyekachi Raphael Ogbu

Abdulmejid Tuni Johar

CoRR, 2023

Masakhane-Afrisenti at SemEval-2023 Task 12: Sentiment Analysis using Afro-centric Language Models and Adapters for Low-resource African Languages.

[BibT_eX]

[DOI]

CoRR, 2023

Adapting to the Low-Resource Double-Bind: Investigating Low-Compute Methods on Low-Resource African Languages.

[BibT_eX]

[DOI]

Colin Leong

Herumb Shandilya

Steven Kolawole

Younwoo Choi

Tosin P. Adewumi

CoRR, 2023

Masakhane-Afrisenti at SemEval-2023 Task 12: Sentiment Analysis using Afro-centric Language Models and Adapters for Low-resource African Languages.

[BibT_eX]

[DOI]

Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities.

[BibT_eX]

[DOI]

Proceedings of the Fourth workshop on Resources for African Indigenous Languages (RAIL 2023), 2023

AfriNames: Most ASR Models "Butcher" African Names.

[BibT_eX]

[DOI]

Tobi Olatunji

Tejumade Afonja

Amina Mardiyyah Rufai

Sahib Singh

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

MasakhaNEWS: News Topic Classification for African languages.

[BibT_eX]

[DOI]

Akintunde Oladipo

Doreen Nixdorf

Abraham Toluwase Owodunni

Nnaemeka C. Obiefuna

Muhidin Mohamed

Saheed Abdullahi Salahudeen

Teshome Mulugeta Ababu

Oluwabusayo Olufunke Awoyomi

Onyekachi Raphael Ogbu

Abdulmejid Tuni Johar

Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Cross-lingual Open-Retrieval Question Answering for African Languages.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

The Less the Merrier? Investigating Language Representation in Multilingual Models.

[BibT_eX]

[DOI]

Hellina Nigatu

Jugal Kalita

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Adapting to the Low-Resource Double-Bind: Investigating Low-Compute Methods on Low-Resource African Languages.

[BibT_eX]

[DOI]

Colin Leong

Herumb Shandilya

Proceedings of the 4th Workshop on African Natural Language Processing, 2023

MasakhaNEWS: News Topic Classification for African languages.

[BibT_eX]

[DOI]

Proceedings of the 4th Workshop on African Natural Language Processing, 2023

2022

Transformer-based Model for Word Level Language Identification in Code-mixed Kannada-English Texts.

[BibT_eX]

[DOI]

CoRR, 2022

AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages.

[BibT_eX]

[DOI]

Oluwabusayo Olufunke Awoyomi