Vukosi Marivate

Orcid: 0000-0002-6731-6267

Affiliations:
  • University of Pretoria, Department of Computer Science, South Africa


According to our database1, Vukosi Marivate authored at least 72 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
From N-grams to Pre-trained Multilingual Models For Language Identification.
CoRR, 2024

Cross-lingual transfer of multilingual models on low resource African Languages.
CoRR, 2024

InkubaLM: A small language model for low-resource African languages.
CoRR, 2024

BOTS-LM: Training Large Language Models for Setswana.
CoRR, 2024

Prompting Towards Alleviating Code-Switched Data Scarcity in Under-Resourced Languages with GPT as a Pivot.
CoRR, 2024

ChatGPT as a Text Annotation Tool to Evaluate Sentiment Analysis on South African Financial Institutions.
IEEE Access, 2024

Correcting FLORES Evaluation Dataset for Four African Languages.
Proceedings of the Ninth Conference on Machine Translation, 2024

2023
A review and comparative study of cancer detection using machine learning: SBERT and SimCSE application.
BMC Bioinform., December, 2023

On the transparency of large AI models.
Patterns, July, 2023

Multimodal Misinformation Detection in a South African Social Media Environment.
CoRR, 2023

PuoBERTa: Training and evaluation of a curated language model for Setswana.
CoRR, 2023

Investigating the Efficacy of Large Language Models in Reflective Assessment Methods through Chain of Thoughts Prompting.
CoRR, 2023

Izindaba-Tindzaba: Machine learning news categorisation for Long and Short Text for isiZulu and Siswati.
CoRR, 2023

Textual Augmentation Techniques Applied to Low Resource Machine Translation: Case of Swahili.
CoRR, 2023

Preparing the Vuk'uzenzele and ZA-gov-multilingual South African multilingual corpora.
CoRR, 2023

Integrating Bidirectional Long Short-Term Memory with Subword Embedding for Authorship Attribution.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

Unsupervised Cross-lingual Word Embedding Representation for English-isiZulu.
Proceedings of the Fourth workshop on Resources for African Indigenous Languages (RAIL 2023), 2023

Preparing the Vuk'uzenzele and ZA-gov-multilingual South African multilingual corpora.
Proceedings of the Fourth workshop on Resources for African Indigenous Languages (RAIL 2023), 2023

Investigating the Efficacy of Large Language Models in Reflective Assessment Methods through Chain of Thought Prompting.
Proceedings of the 4th African Human Computer Interaction Conference, 2023

Fine-Tuning Multilingual Pretrained African Language Models.
Proceedings of the 4th Workshop on African Natural Language Processing, 2023

MphayaNER: Named Entity Recognition for Tshivenda.
Proceedings of the 4th Workshop on African Natural Language Processing, 2023


2022
Why is this an anomaly? Explaining anomalies using sequential explanations.
Pattern Recognit., 2022

Conversational Pattern Mining using Motif Detection.
CoRR, 2022

A Framework for Undergraduate Data Collection Strategies for Student Support Recommendation Systems in Higher Education.
CoRR, 2022

Comparing Synthetic Tabular Data Generation Between a Probabilistic Model and a Deep Learning Model for Education Use Cases.
CoRR, 2022

Findings of the WMT'22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages.
Proceedings of the Seventh Conference on Machine Translation, 2022


Semi-supervised learning approaches for predicting South African political sentiment for local government elections.
Proceedings of the dg.o 2022: The 23rd Annual International Conference on Digital Government Research, Virtual Event, Republic of Korea, June 15, 2022

Reinforcement Learning in Education: A Multi-armed Bandit Approach.
Proceedings of the Emerging Technologies for Developing Countries, 2022

2021
Data curation during a pandemic and lessons learned from COVID-19.
Nat. Comput. Sci., 2021

Digital forensics supported by machine learning for the detection of online sexual predatory chats.
Digit. Investig., 2021

NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2021

Training Cross-Lingual embeddings for Setswana and Sepedi.
CoRR, 2021


Practical Approach on Implementation of WordNets for South African Languages.
Proceedings of the 11th Global Wordnet Conference, 2021

Transformer-based Machine Translation for Low-resourced Languages embedded with Language Identification.
Proceedings of the Conference on Information Communications Technology and Society, 2021

Towards Financial Sentiment Analysis in a South African Landscape.
Proceedings of the Machine Learning and Knowledge Extraction, 2021

Is it Fake? News Disinformation Detection on South African News Websites.
Proceedings of the 2021 IEEE AFRICON, 2021

Call Centre Shift Schedule Optimisation using Local Search Heuristics.
Proceedings of the 2021 IEEE AFRICON, 2021

Extracting and categorising the reactions to COVID-19 by the South African public - A social media study.
Proceedings of the 2021 IEEE AFRICON, 2021

Investigating Statistical and Machine Learning Techniques to Improve the Credit Approval Process in Developing Countries.
Proceedings of the 2021 IEEE AFRICON, 2021

An empirical investigation into audio pipeline approaches for classifying bird species.
Proceedings of the 2021 IEEE AFRICON, 2021

2020
Use of Available Data To Inform The COVID-19 Outbreak in South Africa: A Case Study.
Data Sci. J., 2020

Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages.
CoRR, 2020

AI4D - African Language Dataset Challenge.
CoRR, 2020

Mapping the South African health landscape in response to COVID-19.
CoRR, 2020

Low resource language dataset creation, curation and classification: Setswana and Sepedi - Extended Abstract.
CoRR, 2020

Investigating similarities and differences between South African and Sierra Leonean school outcomes using Machine Learning.
CoRR, 2020


Investigating an approach for low resource language dataset creation, curation and classification: Setswana and Sepedi.
Proceedings of the 1st AfricaNLP Workshop Proceedings, 2020

Unsupervised Anomaly Detection of Healthcare Providers Using Generative Adversarial Networks.
Proceedings of the Responsible Design, Implementation and Use of Information and Communication Technology, 2020


Improving Short Text Classification Through Global Augmentation Methods.
Proceedings of the Machine Learning and Knowledge Extraction, 2020

2019
Predicting Road Traffic Accident Severity using Accident Report Data in South Africa.
Proceedings of the 20th Annual International Conference on Digital Government Research, 2019

2018
Exploring data science for public good in South Africa: evaluating factors that lead to success.
Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age, 2018

2017
Employment relations: a data driven analysis of job markets using online job boards and online professional networks.
Proceedings of the International Conference on Web Intelligence, 2017

Semi-supervised probabilistics approach for normalising informal short text messages.
Proceedings of the 2017 Conference on Information Communication Technology and Society (ICTAS), 2017

A Critical and Systemic Consideration of Data for Sustainable Development in Africa.
Proceedings of the Information and Communication Technologies for Development, 2017

Bringing sequential feature explanations to life.
Proceedings of the IEEE AFRICON 2017, Cape Town, South Africa, September 18-20, 2017, 2017

2016
Unsupervised learning for robust Bitcoin fraud detection.
Proceedings of the 2016 Information Security for South Africa, 2016

A Multifaceted Approach to Bitcoin Fraud Detection: Global and Local Outliers.
Proceedings of the 15th IEEE International Conference on Machine Learning and Applications, 2016

2015
Privacy in mining crime data from social Media: A South African perspective.
Proceedings of the 2015 Second International Conference on Information Security and Cyber Forensics, 2015

2014
Quantifying Uncertainty in Batch Personalized Sequential Decision Making.
Proceedings of the Modern Artificial Intelligence for Health Analytics, 2014

2013
An Ensemble of Linearly Combined Reinforcement-Learning Agents.
Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

2011
Apprenticeship Learning About Multiple Intentions.
Proceedings of the 28th International Conference on Machine Learning, 2011

Scratchable Devices: User-Friendly Programming for Household Appliances.
Proceedings of the Human-Computer Interaction. Towards Mobile and Intelligent Interaction Environments, 2011

2008
Social Learning Methods in Board Games
CoRR, 2008

An Intelligent Multi-Agent Recommender System for Human Capacity Building
CoRR, 2008

Introduction to Relational Networks for Classification
CoRR, 2008

Social Learning methods in board game agents.
Proceedings of the 2008 IEEE Symposium on Computational Intelligence and Games, 2008

2007
Autoencoder, Principal Component Analysis and Support Vector Regression for Data Imputation
CoRR, 2007


  Loading...