Building Machine Translation (MT) systems for low-resource languages remains challenging. For many language pairs, parallel data are not widely available, and in such cases MT models do not achieve results comparable to those seen with high-resource languages. When data are scarce, it is of paramount importance to make optimal use of the limited material available. To that end, in this paper we propose employing the same parallel sentences multiple times, only changing the way the words are split each time. For this purpose we use several Byte Pair Encoding models, with various merge operations used in their configuration. In our experiments, we use this technique to expand the available data and improve an MT system involving a low...
The goal of neural machine translation (NMT) is to build an end-to-end system that automatically tr...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
192 p.Modern machine translation relies on strong supervision in the form of parallel corpora. Such ...
Building Machine Translation (MT) systems for low-resource languages remains challenging. For many l...
Thesis (Master's)--University of Washington, 2018In an emergency, machine translation systems can be...
There are several approaches for improving neural machine translation for low-resource languages: mo...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
Thai is a low-resource language, so it is often the case that data is not available in sufficient qu...
Machine translation needs a large number of parallel sentence pairs to make sure of having a good tr...
Previously, statistical machine translation (SMT) models have been estimated from parallel corpora, ...
We discuss a previously proposed method for augmenting parallel corpora of limited size for the purp...
In the context of neural machine translation, data augmentation (DA) techniques may be used for gene...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
In this paper, we present a new hybridization approach consisting of enriching the phrase table of a...
In the past few decades machine translation research has made major progress. A researcher now has a...
The goal of neural machine translation (NMT) is to build an end-to-end system that automatically tr...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
192 p.Modern machine translation relies on strong supervision in the form of parallel corpora. Such ...
Building Machine Translation (MT) systems for low-resource languages remains challenging. For many l...
Thesis (Master's)--University of Washington, 2018In an emergency, machine translation systems can be...
There are several approaches for improving neural machine translation for low-resource languages: mo...
2019-02-14We provide new tools and techniques for improving machine translation for low-resource lan...
Thai is a low-resource language, so it is often the case that data is not available in sufficient qu...
Machine translation needs a large number of parallel sentence pairs to make sure of having a good tr...
Previously, statistical machine translation (SMT) models have been estimated from parallel corpora, ...
We discuss a previously proposed method for augmenting parallel corpora of limited size for the purp...
In the context of neural machine translation, data augmentation (DA) techniques may be used for gene...
Neural machine translation (NMT) is often described as ‘data hungry’ as it typically requires large ...
In this paper, we present a new hybridization approach consisting of enriching the phrase table of a...
In the past few decades machine translation research has made major progress. A researcher now has a...
The goal of neural machine translation (NMT) is to build an end-to-end system that automatically tr...
With the advent of deep neural networks in recent years, Neural Machine Translation (NMT) systems ha...
192 p.Modern machine translation relies on strong supervision in the form of parallel corpora. Such ...