Shamsuddeen Hassan Muhammad

Orcid: 0000-0001-7708-0799

According to our database1, Shamsuddeen Hassan Muhammad authored at least 46 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Quantity vs. Quality of Monolingual Source Data in Automatic Text Translation: Can It Be Too Little If It Is Too Good?
CoRR, 2024

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages.
CoRR, 2024

IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models.
CoRR, 2024

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages.
CoRR, 2024

Analyzing COVID-19 Vaccination Sentiments in Nigerian Cyberspace: Insights from a Manually Annotated Twitter Dataset.
CoRR, 2024

Findings of WMT2024 English-to-Low Resource Multimodal Translation Task.
Proceedings of the Ninth Conference on Machine Translation, 2024

Correcting FLORES Evaluation Dataset for Four African Languages.
Proceedings of the Ninth Conference on Machine Translation, 2024





2023
Combining Symbolic and Deep Learning Approaches for Sentiment Analysis.
Proceedings of the Compendium of Neurosymbolic Artificial Intelligence, 2023

Leveraging Closed-Access Multilingual Embedding for Automatic Sentence Alignment in Low Resource Languages.
CoRR, 2023

AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages.
CoRR, 2023

AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages.
CoRR, 2023

The African Stopwords project: curating stopwords for African languages.
CoRR, 2023

MasakhaNEWS: News Topic Classification for African languages.
CoRR, 2023

Adapting to the Low-Resource Double-Bind: Investigating Low-Compute Methods on Low-Resource African Languages.
CoRR, 2023

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages.
CoRR, 2023

SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval).
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-information for Multi-level Sexism Classification.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023


MasakhaNEWS: News Topic Classification for African languages.
Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023

Symbolic Versus Deep Learning Techniques for Explainable Sentiment Analysis.
Proceedings of the Progress in Artificial Intelligence, 2023



HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023


2022
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets.
Trans. Assoc. Comput. Linguistics, 2022

HERDPhobia: A Dataset for Hate Speech against Fulani in Nigeria.
CoRR, 2022

Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African Languages.
CoRR, 2022

Deep Sequence Models for Text Classification Tasks.
CoRR, 2022

Ìtàkúròso: Exploiting Cross-Lingual Transferability for Natural Language Generation of Dialogues in Low-Resource, African Languages.
CoRR, 2022

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis.
CoRR, 2022

Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African Languages.
Proceedings of the Seventh Conference on Machine Translation, 2022


NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine Translation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022


2021
MasakhaNER: Named Entity Recognition for African Languages.
Trans. Assoc. Comput. Linguistics, 2021

2020
A Survey on Machine Learning Techniques in Movie Revenue Prediction.
SN Comput. Sci., 2020

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages.
CoRR, 2020


Incremental Approach for Automatic Generation of Domain-Specific Sentiment Lexicon.
Proceedings of the Advances in Information Retrieval, 2020

2016
Massive Open Online Courses: A Success of Cloud Computing in Education.
Proceedings of the 2nd International Conference on Computing Research and Innovations, 2016


  Loading...