Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2022.South Africa has eleven official languages and amongst the eleven languages only 9 languages are local low-resourced languages. As a result, it is essential to build the resources for these languages so that they can benefit from advances in the field of natural language processing. In this project, the focus was to create annotated datasets for the isiZulu and siSwati local languages based on news topic classification tasks and present the findings from these baseline classification models. Due to the shortage of data for these local South African languages, the datasets that were created were augmented and oversampled to increase data size and overcome class c...
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 S...
Thesis (MSc)--Stellenbosch University, 2021.ENGLISH ABSTRACT: The majority of African languages have...
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 S...
Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2023.It was researched whether a...
Data statement of the WordNets for South Africa languages Data set name: WordNets for South Afri...
IsiZulu News (articles and headlines) and Siswati News (headlines) Corpora - za-isizulu-siswati-news...
There are over 7000 languages spoken on earth, but many of these languages suffer from a dearth of n...
Language models are the foundation of current neural network-based models for natural language under...
Thesis (PhD)--Stellenbosch University, 2018.ENGLISH ABSTRACT: Code-switching refers to natural, spon...
Over the past five years neural network models have been successful across a range of computational ...
This article presents results from a study that developed and tested a word embedding trained on a d...
This article presents results from a study that developed and tested a word embedding trained on a d...
The South African Gov-ZA multilingual corpus ============================== Github: https://github.c...
Correct spelling contributes to good content accessibility and readability for textual documents. Ho...
Correct spelling contributes to good content accessibility and readability for textual documents. Ho...
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 S...
Thesis (MSc)--Stellenbosch University, 2021.ENGLISH ABSTRACT: The majority of African languages have...
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 S...
Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2023.It was researched whether a...
Data statement of the WordNets for South Africa languages Data set name: WordNets for South Afri...
IsiZulu News (articles and headlines) and Siswati News (headlines) Corpora - za-isizulu-siswati-news...
There are over 7000 languages spoken on earth, but many of these languages suffer from a dearth of n...
Language models are the foundation of current neural network-based models for natural language under...
Thesis (PhD)--Stellenbosch University, 2018.ENGLISH ABSTRACT: Code-switching refers to natural, spon...
Over the past five years neural network models have been successful across a range of computational ...
This article presents results from a study that developed and tested a word embedding trained on a d...
This article presents results from a study that developed and tested a word embedding trained on a d...
The South African Gov-ZA multilingual corpus ============================== Github: https://github.c...
Correct spelling contributes to good content accessibility and readability for textual documents. Ho...
Correct spelling contributes to good content accessibility and readability for textual documents. Ho...
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 S...
Thesis (MSc)--Stellenbosch University, 2021.ENGLISH ABSTRACT: The majority of African languages have...
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 S...