Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 million people in the Indian subcontinent. At the same time, it is a severely low resourced language. In this paper, we present work on English–Telugu general domain machine translation (MT) systems using small amounts of parallel data. The baseline statistical (SMT) and neural MT (NMT) systems do not yield acceptable translation quality, mostly due to limited resources. However, the use of synthetic parallel data (generated using back translation, based on an NMT engine) significantly improves translation quality and allows NMT to outperform SMT. We extend back translation and propose a new, iterative data augmentation (IDA) method. Filtering o...
This paper describes the ADAPT Centre submissions to WAT 2020 for the English-to-Odia translation ta...
We consider a low-resource translation task from Finnish into Northern Sámi. Collecting all availabl...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
A large percentage of the world’s population speaks a language of the Indian subcontinent, what we w...
Neural Machine Translation (NMT) models have achieved remarkable performance on translating between ...
Statistical machine translation (SMT) which was the dominant paradigm in machine translation (MT) re...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
Most Indian languages lack sufficient parallel data for Machine Translation (MT) training. In this s...
This paper describes the ADAPT Centre’s submissions to the WMT20 News translation shared task for En...
Statistical machine translation (SMT) was the state-of-the-art in machine translation (MT) research ...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
In this paper, we investigate the effectiveness of training a multimodal neural machine translation ...
Machine translation (MT) has benefited from using synthetic training data originating from translati...
This paper describes the ADAPT Centre submissions to WAT 2020 for the English-to-Odia translation ta...
We consider a low-resource translation task from Finnish into Northern Sámi. Collecting all availabl...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
A large percentage of the world’s population speaks a language of the Indian subcontinent, what we w...
Neural Machine Translation (NMT) models have achieved remarkable performance on translating between ...
Statistical machine translation (SMT) which was the dominant paradigm in machine translation (MT) re...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
Most Indian languages lack sufficient parallel data for Machine Translation (MT) training. In this s...
This paper describes the ADAPT Centre’s submissions to the WMT20 News translation shared task for En...
Statistical machine translation (SMT) was the state-of-the-art in machine translation (MT) research ...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
In this paper, we investigate the effectiveness of training a multimodal neural machine translation ...
Machine translation (MT) has benefited from using synthetic training data originating from translati...
This paper describes the ADAPT Centre submissions to WAT 2020 for the English-to-Odia translation ta...
We consider a low-resource translation task from Finnish into Northern Sámi. Collecting all availabl...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...