Data augmentation can improve model’s final accuracy by introducing new data samples to the dataset. In this paper, text data augmentation using translation technique is investigated. Synthetic translations, generated by Opus-MT model are compared to the unique foreign data samples in terms of an impact to the trans- former network-based models’ performance. The experimental results showed that multilingual models like DistilBERT in some cases benefit from the introduction of the addition artificially created data samples presented in a foreign language
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
Data-driven machine translation paradigms—which use machine learning to create translation models th...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
Data augmentation can improve model’s final accuracy by introducing new data samples to the dataset....
This paper describes FBK’s submission to the end-to-end speech translation (ST) task at IWSLT 2019. ...
The Transformer model is a very recent, fast and powerful discovery in neural machine translation. W...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
Statistical machine translation (SMT) has emerged as the currently most promising approach for machi...
In Neural Machine Translation (NMT), data augmentation methods such as back-translation have proven ...
Data augmentation is a technique to generate new training data based on existing data. We evaluate t...
We train Transformer-based neural machine translation models for Hungarian-English and English-Hunga...
We use target-side monolingual data to extend the vocabulary of the translation model in statistical...
In the past few decades machine translation research has made major progress. A researcher now has a...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
Data-driven machine translation paradigms—which use machine learning to create translation models th...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
Data augmentation can improve model’s final accuracy by introducing new data samples to the dataset....
This paper describes FBK’s submission to the end-to-end speech translation (ST) task at IWSLT 2019. ...
The Transformer model is a very recent, fast and powerful discovery in neural machine translation. W...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
Statistical machine translation (SMT) has emerged as the currently most promising approach for machi...
In Neural Machine Translation (NMT), data augmentation methods such as back-translation have proven ...
Data augmentation is a technique to generate new training data based on existing data. We evaluate t...
We train Transformer-based neural machine translation models for Hungarian-English and English-Hunga...
We use target-side monolingual data to extend the vocabulary of the translation model in statistical...
In the past few decades machine translation research has made major progress. A researcher now has a...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
Data-driven machine translation paradigms—which use machine learning to create translation models th...
A prerequisite for training corpus-based machine translation (MT) systems – either Statistical MT (S...