The performance of Phrase-Based Statis- tical Machine Translation (PBSMT) systems mostly depends on training data. Many papers have investigated how to cre- ate new resources in order to increase the size of the training corpus in an attempt to improve PBSMT performance. In this work, we analyse and characterize the way in which the in-domain and out-of-domain performance of PBSMT is impacted when the amount of training data increases. Two different PBSMT systems, Moses and Portage, two of the largest par- allel corpora, Giga (French-English) and UN (Chinese-English) datasets and several in- and out-of-domain test sets were used to build high quality learning curves showing consistent logarithmic growth in per- formance. These results are s...
Parallel corpus is an indispensable resource for translation model training in statistical machine t...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
In this thesis we develop and evaluate a general framework for domain-adaptation of statistical mach...
The performance of Phrase-Based Statistical Machine Translation (PBSMT) systems mostly depends on ...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
We present an extensive experimental study of Phrase-based Statistical Machine Translation, from the...
We present an extensive experimental study of Phrase-based Statistical Machine Translation, from the...
We present an extensive experimental study of a Statistical Machine Translation system, Moses (Koehn...
Domain adaptation has recently gained interest in statistical machine translation to cope with the p...
The training data size is of utmost importance for statistical machine translation (SMT), since it a...
We present an extensive experimental study of a Statistical Machine Translation system, Moses (Koehn...
We report on findings of exploiting large data sets for translation modeling, language mod-eling and...
Abstract tra Statistical machine translation systems are usually trained on large amounts of bilingu...
We present an extensive experimental study of a Statistical Machine Translation system, Moses (Koehn...
Machine translation is the application of machines to translate text or speech from one natural lang...
Parallel corpus is an indispensable resource for translation model training in statistical machine t...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
In this thesis we develop and evaluate a general framework for domain-adaptation of statistical mach...
The performance of Phrase-Based Statistical Machine Translation (PBSMT) systems mostly depends on ...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
We present an extensive experimental study of Phrase-based Statistical Machine Translation, from the...
We present an extensive experimental study of Phrase-based Statistical Machine Translation, from the...
We present an extensive experimental study of a Statistical Machine Translation system, Moses (Koehn...
Domain adaptation has recently gained interest in statistical machine translation to cope with the p...
The training data size is of utmost importance for statistical machine translation (SMT), since it a...
We present an extensive experimental study of a Statistical Machine Translation system, Moses (Koehn...
We report on findings of exploiting large data sets for translation modeling, language mod-eling and...
Abstract tra Statistical machine translation systems are usually trained on large amounts of bilingu...
We present an extensive experimental study of a Statistical Machine Translation system, Moses (Koehn...
Machine translation is the application of machines to translate text or speech from one natural lang...
Parallel corpus is an indispensable resource for translation model training in statistical machine t...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
In this thesis we develop and evaluate a general framework for domain-adaptation of statistical mach...