Thesis (MSc)--Stellenbosch University, 2021.ENGLISH ABSTRACT: The majority of African languages have not fully benefited from the recent advances in machine translation due to lack of data. Motivated by this challenge we leverage and compare transfer learning, multilingual learning and zero-shot learning on three Southern Bantu languages (namely isiZulu, isiXhosa and Shona) and English. We focus primarily on the English-to-isiZulu pair, since it has the smallest number of training pairs (30000 sentences), comprising just 28% of the average size of the other corpora. We demonstrate the significant importance of language similarity on English-to-isiZulu translations by comparing transfer learning and multilingual learning on the Englis...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
African languages are spoken by over a billion people, but are underrepresented in NLP research and ...
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 S...
Language models are the foundation of current neural network-based models for natural language under...
International audienceMultilingual transformer models like mBERT and XLM-RoBERTa have obtained great...
Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2023.It was researched whether a...
Neural Machine Translation (NMT) models have achieved remarkable performance on translating between ...
Thesis (PhD)--Stellenbosch University, 2018.ENGLISH ABSTRACT: Code-switching refers to natural, spon...
Over the past five years neural network models have been successful across a range of computational ...
Transfer learning has led to large gains in performance for nearly all NLP tasks while making downst...
The paper demonstrates the feasibility and scalability of participatory research, with a case study ...
This report details the author’s experiences as a Distributed Re-search Experience for Undergraduate...
Multilingual pre-trained language models (PLMs) have demonstrated impressive performance on several ...
There are over 7000 languages spoken on earth, but many of these languages suffer from a dearth of n...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
African languages are spoken by over a billion people, but are underrepresented in NLP research and ...
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 S...
Language models are the foundation of current neural network-based models for natural language under...
International audienceMultilingual transformer models like mBERT and XLM-RoBERTa have obtained great...
Mini Dissertation (MIT (Big Data Science))--University of Pretoria, 2023.It was researched whether a...
Neural Machine Translation (NMT) models have achieved remarkable performance on translating between ...
Thesis (PhD)--Stellenbosch University, 2018.ENGLISH ABSTRACT: Code-switching refers to natural, spon...
Over the past five years neural network models have been successful across a range of computational ...
Transfer learning has led to large gains in performance for nearly all NLP tasks while making downst...
The paper demonstrates the feasibility and scalability of participatory research, with a case study ...
This report details the author’s experiences as a Distributed Re-search Experience for Undergraduate...
Multilingual pre-trained language models (PLMs) have demonstrated impressive performance on several ...
There are over 7000 languages spoken on earth, but many of these languages suffer from a dearth of n...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
African languages are spoken by over a billion people, but are underrepresented in NLP research and ...