Statistical machine translation systems are usually trained on large amounts of bilingual text and of monolingual text in the target language. In this paper, we will present a self-training approach which additionally explores the use of monolin-gual source text, namely the documents to be translated, to improve the system performance. An initial version of the translation system is used to translate the source text. Among the generated translations, target sentences of low quality are automatically identified and discarded. The reliable trans-lations together with their sources are then used as a new bilingual corpus for training an additional phrase translation model. Thus, the translation system can be adapted to the new source data even...
Sentence-aligned bilingual texts are a crucial resource to build statistical machine translation (SM...
This paper proposes to use monolingual collocations to improve Statistical Ma-chine Translation (SMT...
This paper proposes a method using the ex-isting Rule-based Machine Translation (RBMT) system as a b...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
Domain adaptation has recently gained interest in statistical machine translation to cope with the p...
Traditionally, statistical machine translation systems have relied on parallel bi-lingual data to tr...
We use target-side monolingual data to extend the vocabulary of the translation model in statistical...
Statistical machine translation systems are usually trained on large amounts of bilingual text (used...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
We investigate how to improve bilingual embedding which has been successfully used as a feature in p...
We investigate how to improve bilingual embedding which has been successfully used as a feature in p...
In the past few decades machine translation research has made major progress. A researcher now has a...
Abstract tra Statistical machine translation systems are usually trained on large amounts of bilingu...
Statistical machine translation systems are usually trained on large amounts of bilingual text and m...
Monolingual data have been demonstrated to be helpful in improving translation quality of both stati...
Sentence-aligned bilingual texts are a crucial resource to build statistical machine translation (SM...
This paper proposes to use monolingual collocations to improve Statistical Ma-chine Translation (SMT...
This paper proposes a method using the ex-isting Rule-based Machine Translation (RBMT) system as a b...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
Domain adaptation has recently gained interest in statistical machine translation to cope with the p...
Traditionally, statistical machine translation systems have relied on parallel bi-lingual data to tr...
We use target-side monolingual data to extend the vocabulary of the translation model in statistical...
Statistical machine translation systems are usually trained on large amounts of bilingual text (used...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
We investigate how to improve bilingual embedding which has been successfully used as a feature in p...
We investigate how to improve bilingual embedding which has been successfully used as a feature in p...
In the past few decades machine translation research has made major progress. A researcher now has a...
Abstract tra Statistical machine translation systems are usually trained on large amounts of bilingu...
Statistical machine translation systems are usually trained on large amounts of bilingual text and m...
Monolingual data have been demonstrated to be helpful in improving translation quality of both stati...
Sentence-aligned bilingual texts are a crucial resource to build statistical machine translation (SM...
This paper proposes to use monolingual collocations to improve Statistical Ma-chine Translation (SMT...
This paper proposes a method using the ex-isting Rule-based Machine Translation (RBMT) system as a b...