Back translation is one of the most widely used methods for improving the performance of neural machine translation systems. Recent research has sought to enhance the effectiveness of this method by increasing the 'diversity' of the generated translations. We argue that the definitions and metrics used to quantify 'diversity' in previous work have been insufficient. This work puts forward a more nuanced framework for understanding diversity in training data, splitting it into lexical diversity and syntactic diversity. We present novel metrics for measuring these different aspects of diversity and carry out empirical analysis into the effect of these types of diversity on final neural machine translation model performance for low-resource En...
There are several approaches for improving neural machine translation for low-resource languages: mo...
The scarcity of parallel data is a major limitation for Neural Machine Translation (NMT) systems, in...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
Back translation is one of the most widely used methods for improving the performance of neural mach...
Machine translation (MT) has benefited from using synthetic training data originating from translati...
Neural machine translation systems have revolutionized translation processes in terms of quantity an...
Neural machine translation systems have revolutionized translation processes in terms of quantity an...
This work presents an empirical approach to quantifying the loss of lexical richness in Machine Tran...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
Neural Machine Translation has achieved state-of-the-art performance for several language pairs usin...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
There are several approaches for improving neural machine translation for low-resource languages: mo...
The scarcity of parallel data is a major limitation for Neural Machine Translation (NMT) systems, in...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...
Back translation is one of the most widely used methods for improving the performance of neural mach...
Machine translation (MT) has benefited from using synthetic training data originating from translati...
Neural machine translation systems have revolutionized translation processes in terms of quantity an...
Neural machine translation systems have revolutionized translation processes in terms of quantity an...
This work presents an empirical approach to quantifying the loss of lexical richness in Machine Tran...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
Neural Machine Translation has achieved state-of-the-art performance for several language pairs usin...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
Neural machine translation (NMT), where neural networks are used to generate translations, has revol...
Some natural languages belong to the same family or share similar syntactic and/or semantic regulari...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
There are several approaches for improving neural machine translation for low-resource languages: mo...
The scarcity of parallel data is a major limitation for Neural Machine Translation (NMT) systems, in...
Telugu is the fifteenth most commonly spoken language in the world with an estimated reach of 75 mil...