David Ifeoluwa Adelani

Orcid: 0000-0002-0193-2083

According to our database1, David Ifeoluwa Adelani authored at least 74 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines.
CoRR, 2024

The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources.
CoRR, 2024

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark.
CoRR, 2024

IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models.
CoRR, 2024

Which Nigerian-Pidgin does Generative AI speak?: Issues about Representativeness and Bias for Multilingual and Low Resource Languages.
CoRR, 2024

Comparing LLM prompting with Cross-lingual transfer performance on Indigenous and Low-resource Brazilian Languages.
CoRR, 2024

EkoHate: Abusive Language and Hate Speech Detection for Code-switched Political Discussions on Nigerian Twitter.
CoRR, 2024

ANGOFA: Leveraging OFA Embedding Initialization and Synthetic Data for Angolan Language Model.
CoRR, 2024

Evaluating WMT 2024 Metrics Shared Task Submissions on AfriMTE (the African Challenge Set).
Proceedings of the Ninth Conference on Machine Translation, 2024

Are LLMs Breaking MT Metrics? Results of the WMT24 Metrics Shared Task.
Proceedings of the Ninth Conference on Machine Translation, 2024


MINERS: Multilingual Language Models as Semantic Retrievers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

ÌròyìnSpeech: A Multi-purpose Yorùbá Speech Corpus.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024


2023
Consultative engagement of stakeholders toward a roadmap for African language technologies.
Patterns, August, 2023

AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages.
CoRR, 2023

How good are Large Language Models on African Languages?
CoRR, 2023

YORC: Yoruba Reading Comprehension dataset.
CoRR, 2023

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages.
CoRR, 2023

MasakhaNEWS: News Topic Classification for African languages.
CoRR, 2023

E KÚ [MASK]: Integrating Yorùbá cultural greetings into machine translation.
CoRR, 2023

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages.
CoRR, 2023

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval).
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Improving Language Plasticity via Pretraining with Active Forgetting.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MasakhaNEWS: News Topic Classification for African languages.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023


Better Quality Pre-training Data and T5 Models for African Languages.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023



MphayaNER: Named Entity Recognition for Tshivenda.
Proceedings of the 4th Workshop on African Natural Language Processing, 2023

ε kú : Integrating YorùBá Cultural greetings into Machine Translation.
Proceedings of the 4th Workshop on African Natural Language Processing, 2023


BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023


2022
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.
CoRR, 2022

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model.
CoRR, 2022

Task-Adaptive Pre-Training for Boosting Learning With Noisy Labels: A Study on Text Classification for African Languages.
CoRR, 2022

yosm: A new yoruba sentiment corpus for movie reviews.
CoRR, 2022

Multilingual Language Model Adaptive Fine-Tuning: A Study on African Languages.
CoRR, 2022

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis.
CoRR, 2022

Findings of the WMT'22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages.
Proceedings of the Seventh Conference on Machine Translation, 2022

TOKEN Is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models.
Proceedings of the Text, Speech, and Dialogue - 25th International Conference, 2022


MCSE: Multimodal Contrastive Learning of Sentence Embeddings.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022


NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022


Few-Shot Pidgin Text Adaptation via Contrastive Fine-Tuning.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Adapting Pre-trained Language Models to African Languages via Multilingual Adaptive Fine-Tuning.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Pre-Trained Multilingual Sequence-to-Sequence Models: A Hope for Low-Resource Language Translation?
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification.
Proceedings of the Third Workshop on Insights from Negative Results in NLP, 2022

2021
MasakhaNER: Named Entity Recognition for African Languages.
Trans. Assoc. Comput. Linguistics, 2021


MENYO-20k: A Multi-domain English-Yorùbá Corpus for Machine Translation and Domain Adaptation.
Proceedings of the 2nd AfricaNLP Workshop Proceedings, AfricaNLP@EACL 2021, Virtual Event, 2021

The Effect of Domain and Diacritics in Yoruba-English Neural Machine Translation.
Proceedings of the 18th Biennial Machine Translation Summit - Volume 1: Research Track, 2021

Preventing Author Profiling through Zero-Shot Multilingual Back-Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

2020
Estimating community feedback effect on topic choice in social media with predictive modeling.
EPJ Data Sci., 2020

Robust Differentially Private Training of Deep Neural Networks.
CoRR, 2020

Improving Yorùbá Diacritic Restoration.
Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020

Distant Supervision and Noisy Label Learning for Low Resource Named Entity Recognition: A Study on Hausa and Yorùbá.
Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020

Unsupervised Pidgin Text Generation By Pivoting English Data and Self-Training.
Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020

Investigating the Impact of Pre-trained Word Embeddings on Memorization in Neural Networks.
Proceedings of the Text, Speech, and Dialogue, 2020

Massive vs. Curated Embeddings for Low-Resourced Languages: the Case of Yorùbá and Twi.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Privacy Guarantees for De-Identifying Text Transformations.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Transfer Learning and Distant Supervision for Multilingual Transformer Models: A Study on African Languages.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Generating Sentiment-Preserving Fake Online Reviews Using Neural Language Models and Their Human- and Machine-Based Detection.
Proceedings of the Advanced Information Networking and Applications, 2020

2019
Massive vs. Curated Word Embeddings for Low-Resourced Languages. The Case of Yorùbá and Twi.
CoRR, 2019

Demographic Inference and Representative Population Estimates from Multilingual Social Media Data.
Proceedings of the World Wide Web Conference, 2019

2016
Enhancing the reusability and interoperability of artificial neural networks with DEVS modeling and simulation.
Int. J. Model. Simul. Sci. Comput., 2016


  Loading...