Statistical machine translation systems are usually trained on large amounts of bilingual text and of monolingual text in the target language. In this paper, we will present a self-training approach, which additionally explores the use of monolingual source text, namely the documents to be translated, to improve the system performance. An initial version of the translation system is used to translate the source text. Among the generated translations, target sentences of low quality are automatically identified and discarded. The reliable translations together with their sources are then used as a new bilingual corpus for training an additional phrase translation model. Thus, the translation system can be adapted to the new source data even ...
In the past few decades machine translation research has made major progress. A researcher now has a...
Statistical machine translation (SMT) has emerged as the currently most promising approach for machi...
Parallel data scarcity problem is a major challenge faced by Statistical Machine Translation (SMT). ...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
Statistical machine translation systems are usually trained on large amounts of bilingual text (used...
Domain adaptation has recently gained interest in statistical machine translation to cope with the p...
Traditionally, statistical machine translation systems have relied on parallel bi-lingual data to tr...
Statistical machine translation systems are usually trained on large amounts of bilingual text and m...
We use target-side monolingual data to extend the vocabulary of the translation model in statistical...
Abstract tra Statistical machine translation systems are usually trained on large amounts of bilingu...
This paper proposes to use monolingual collocations to improve Statistical Ma-chine Translation (SMT...
This paper proposes a method using the ex-isting Rule-based Machine Translation (RBMT) system as a b...
We investigate how to improve bilingual embedding which has been successfully used as a feature in p...
We investigate how to improve bilingual embedding which has been successfully used as a feature in p...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
In the past few decades machine translation research has made major progress. A researcher now has a...
Statistical machine translation (SMT) has emerged as the currently most promising approach for machi...
Parallel data scarcity problem is a major challenge faced by Statistical Machine Translation (SMT). ...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
Statistical machine translation systems are usually trained on large amounts of bilingual text (used...
Domain adaptation has recently gained interest in statistical machine translation to cope with the p...
Traditionally, statistical machine translation systems have relied on parallel bi-lingual data to tr...
Statistical machine translation systems are usually trained on large amounts of bilingual text and m...
We use target-side monolingual data to extend the vocabulary of the translation model in statistical...
Abstract tra Statistical machine translation systems are usually trained on large amounts of bilingu...
This paper proposes to use monolingual collocations to improve Statistical Ma-chine Translation (SMT...
This paper proposes a method using the ex-isting Rule-based Machine Translation (RBMT) system as a b...
We investigate how to improve bilingual embedding which has been successfully used as a feature in p...
We investigate how to improve bilingual embedding which has been successfully used as a feature in p...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
In the past few decades machine translation research has made major progress. A researcher now has a...
Statistical machine translation (SMT) has emerged as the currently most promising approach for machi...
Parallel data scarcity problem is a major challenge faced by Statistical Machine Translation (SMT). ...