This paper describes the methods behind the systems submitted by the University of Groningen for the WMT 2020 Unsupervised Machine Translation task for German--Upper Sorbian. We investigate the usefulness of data selection in the unsupervised setting. We find that we can perform data selection using a pretrained model and show that the quality of a set of sentences or documents can have a great impact on the performance of the UNMT system trained on it. Furthermore, we show that document-level data selection should be preferred for training the XLM model when possible. Finally, we show that there is a trade-off between quality and quantity of the data used to train UNMT systems
We address the task of automatically distinguishing between human-translated (HT) and machine transl...
Machine translation is the task of automatically translating a text from one natural language into a...
Neural Machine Translation (NMT) has achieved promising results comparable with Phrase-Based Statist...
This paper describes the methods behind the systems submitted by the University of Groningen for the...
This paper describes the methods behind the systems submitted by the University of Groningen for the...
This paper describes the methods behind the systems submitted by the University of Groningen for the...
Thesis (Ph.D.)--University of Washington, 2014Machine translation, the computerized translation of o...
Unsupervised Machine Translation hasbeen advancing our ability to translatewithout parallel data, bu...
Data selection is a process used in selecting a subset of parallel data for the training of machine...
This paper presents the results of the premier shared task organized alongside the Conference on Ma...
With the rapid development of machine translation (MT), the MT evaluation becomes very important to ...
Statistical machine translation is an approach dependent particularly on huge amount of parallel bil...
We address the task of automatically distinguishing between human-translated (HT) and machine transl...
Machine translation is the task of automatically translating a text from one natural language into a...
Neural Machine Translation (NMT) has achieved promising results comparable with Phrase-Based Statist...
This paper describes the methods behind the systems submitted by the University of Groningen for the...
This paper describes the methods behind the systems submitted by the University of Groningen for the...
This paper describes the methods behind the systems submitted by the University of Groningen for the...
Thesis (Ph.D.)--University of Washington, 2014Machine translation, the computerized translation of o...
Unsupervised Machine Translation hasbeen advancing our ability to translatewithout parallel data, bu...
Data selection is a process used in selecting a subset of parallel data for the training of machine...
This paper presents the results of the premier shared task organized alongside the Conference on Ma...
With the rapid development of machine translation (MT), the MT evaluation becomes very important to ...
Statistical machine translation is an approach dependent particularly on huge amount of parallel bil...
We address the task of automatically distinguishing between human-translated (HT) and machine transl...
Machine translation is the task of automatically translating a text from one natural language into a...
Neural Machine Translation (NMT) has achieved promising results comparable with Phrase-Based Statist...