The quality of a Neural Machine Translation system depends substantially on the availability of sizable parallel corpora. For low-resource language pairs this is not the case, resulting in poor translation quality. Inspired by work in computer vision, we propose a novel data augmentation approach that targets low-frequency words by generating new sentence pairs containing rare words in new, synthetically created contexts. Experimental results on simulated low-resource settings show that our method improves translation quality by up to 2.9 BLEU points over the baseline and up to 3.2 BLEU over back-translation
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
Data sparsity is one of the challenges for low-resource language pairs in Neural Machine Translation...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
In the context of neural machine translation, data augmentation (DA) techniques may be used for gene...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
Neural Machine Translation has achieved state-of-the-art performance for several language pairs usin...
Existing Sequence-to-Sequence (Seq2Seq) Neural Machine Translation (NMT) shows strong capability wit...
Neural Machine Translation (NMT) models have achieved remarkable performance on translating between ...
Artificial intelligence based Machine Translation is a Natural Language Processing, attaining signif...
In Neural Machine Translation (NMT), data augmentation methods such as back-translation have proven ...
| openaire: EC/H2020/780069/EU//MeMADThere are several approaches for improving neural machine trans...
Neural machine translation (NMT) has been a mainstream method for the machine translation (MT) task....
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
Data sparsity is one of the challenges for low-resource language pairs in Neural Machine Translation...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
In the context of neural machine translation, data augmentation (DA) techniques may be used for gene...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
Neural Machine Translation has achieved state-of-the-art performance for several language pairs usin...
Existing Sequence-to-Sequence (Seq2Seq) Neural Machine Translation (NMT) shows strong capability wit...
Neural Machine Translation (NMT) models have achieved remarkable performance on translating between ...
Artificial intelligence based Machine Translation is a Natural Language Processing, attaining signif...
In Neural Machine Translation (NMT), data augmentation methods such as back-translation have proven ...
| openaire: EC/H2020/780069/EU//MeMADThere are several approaches for improving neural machine trans...
Neural machine translation (NMT) has been a mainstream method for the machine translation (MT) task....
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
Data sparsity is one of the challenges for low-resource language pairs in Neural Machine Translation...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...