Neural Machine Translation (NMT) models have achieved remarkable performance on translating between high resource languages. However, translation quality for languages with limited data is much worse. This research focuses on the low resource language of Sepedi and considers two data augmentation techniques to increase the size and diversity of English-Sepedi corpora for training an NMT model. First we consider backtranslation, which makes use of the larger amount of available monolingual Sepedi text. We train a reverse (Sepedi to English) model and generate synthetic English sentences from the monolingual Sepedi sentences. These synthetic translations examples are added to the parallel English-Sepedi sentences. We carry out various experim...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
Neural Machine Translation (NMT) has been shown to be more effective in translation tasks compared t...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
There are several approaches for improving neural machine translation for low-resource languages: mo...
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 S...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
The goal of neural machine translation (NMT) is to build an end-to-end system that automatically tr...
The quality of a Neural Machine Translation system depends substantially on the availability of siza...
Data sparsity is one of the challenges for low-resource language pairs in Neural Machine Translation...
In the context of neural machine translation, data augmentation (DA) techniques may be used for gene...
Thesis (MSc)--Stellenbosch University, 2021.ENGLISH ABSTRACT: The majority of African languages have...
Machine translation (MT) has benefited from using synthetic training data originating from translati...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
Neural Machine Translation (NMT) has been shown to be more effective in translation tasks compared t...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
There are several approaches for improving neural machine translation for low-resource languages: mo...
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 S...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
The goal of neural machine translation (NMT) is to build an end-to-end system that automatically tr...
The quality of a Neural Machine Translation system depends substantially on the availability of siza...
Data sparsity is one of the challenges for low-resource language pairs in Neural Machine Translation...
In the context of neural machine translation, data augmentation (DA) techniques may be used for gene...
Thesis (MSc)--Stellenbosch University, 2021.ENGLISH ABSTRACT: The majority of African languages have...
Machine translation (MT) has benefited from using synthetic training data originating from translati...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
Neural Machine Translation (NMT) has been shown to be more effective in translation tasks compared t...