Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 million people in the Indian subcontinent. At the same time, it is a severely low resourced language. In this paper, we present work on English–Telugu general domain machine translation (MT) systems using small amounts of parallel data. The baseline statistical (SMT) and neural MT (NMT) systems do not yield acceptable translation quality, mostly due to limited resources. However, the use of synthetic parallel data (generated using back translation, based on an NMT engine) significantly improves translation quality and allows NMT to outperform SMT. We extend back translation and propose a new, iterative data augmentation (IDA) method. Filtering o...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
The quality of a Neural Machine Translation system depends substantially on the availability of siza...
In cross-language information retrieval (CLIR), the neural machine translation (NMT) plays a vital r...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
Most Indian languages lack sufficient parallel data for Machine Translation (MT) training. In this s...
Machine Translation bridges communication barriers and eases interaction among people having differe...
Introduction of deep neural networks to the machine translation research ameliorated conventional ma...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
Neural machine translation (NMT) has been a mainstream method for the machine translation (MT) task....
We consider a low-resource translation task from Finnish into Northern Sámi. Collecting all availabl...
Phrase-based statistical machine translation (PB-SMT) has been the dominant paradigm in machine tran...
A large percentage of the world’s population speaks a language of the Indian subcontinent, what we w...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
In this paper, we investigate the effectiveness of training a multimodal neural machine translation ...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
The quality of a Neural Machine Translation system depends substantially on the availability of siza...
In cross-language information retrieval (CLIR), the neural machine translation (NMT) plays a vital r...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
Most Indian languages lack sufficient parallel data for Machine Translation (MT) training. In this s...
Machine Translation bridges communication barriers and eases interaction among people having differe...
Introduction of deep neural networks to the machine translation research ameliorated conventional ma...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
Neural machine translation (NMT) has been a mainstream method for the machine translation (MT) task....
We consider a low-resource translation task from Finnish into Northern Sámi. Collecting all availabl...
Phrase-based statistical machine translation (PB-SMT) has been the dominant paradigm in machine tran...
A large percentage of the world’s population speaks a language of the Indian subcontinent, what we w...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
In this paper, we investigate the effectiveness of training a multimodal neural machine translation ...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
The quality of a Neural Machine Translation system depends substantially on the availability of siza...
In cross-language information retrieval (CLIR), the neural machine translation (NMT) plays a vital r...