The performance of a machine translation system (MTS) depends on the quality and size of the training data. How to extend the training dataset for the MTS in specific domains with effective methods to enhance the performance of machine translation needs to be explored. A method for selecting in-domain bilingual sentence pairs based on the topic information is proposed. With the aid of the topic relevance of the bilingual sentence pairs to the target domain, subsets of sentence pairs related to the texts to be translated are selected from a large-scale bilingual corpus to train the translation system in specific domains to improve the translation quality for in-domain texts. Through the test, the bilingual sentence pairs are selected by usin...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
The training data size is of utmost importance for statistical machine translation (SMT), since it a...
The performance of Phrase-Based Statis- tical Machine Translation (PBSMT) systems mostly depends on ...
General-domain corpora are becoming increasingly available for Machine Translation (MT) systems. How...
Abstract. In statistical machine translation, the number of sentence pairs in the bilingual corpus i...
Thesis (Ph.D.)--University of Washington, 2014Machine translation, the computerized translation of o...
Data selection has shown significant improvements in effective use of training data by extracting se...
Abstract. In phrase-based statistical machine translation system, the parameters of model are usuall...
Copyright © 2014 Longyue Wang et al.This is an open access article distributed under the Creative Co...
In the past few decades machine translation research has made major progress. A researcher now has a...
Conference of 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 ; C...
Statistical machine translation is an approach dependent particularly on huge amount of parallel bil...
The statistical framework has proved to be very successful in machine translation. The main reason f...
We propose and study three different novel approaches for tackling the problem of development set se...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
The training data size is of utmost importance for statistical machine translation (SMT), since it a...
The performance of Phrase-Based Statis- tical Machine Translation (PBSMT) systems mostly depends on ...
General-domain corpora are becoming increasingly available for Machine Translation (MT) systems. How...
Abstract. In statistical machine translation, the number of sentence pairs in the bilingual corpus i...
Thesis (Ph.D.)--University of Washington, 2014Machine translation, the computerized translation of o...
Data selection has shown significant improvements in effective use of training data by extracting se...
Abstract. In phrase-based statistical machine translation system, the parameters of model are usuall...
Copyright © 2014 Longyue Wang et al.This is an open access article distributed under the Creative Co...
In the past few decades machine translation research has made major progress. A researcher now has a...
Conference of 29th Pacific Asia Conference on Language, Information and Computation, PACLIC 2015 ; C...
Statistical machine translation is an approach dependent particularly on huge amount of parallel bil...
The statistical framework has proved to be very successful in machine translation. The main reason f...
We propose and study three different novel approaches for tackling the problem of development set se...
Statistical machine translation systems are usually trained on large amounts of bilingual text and o...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
The training data size is of utmost importance for statistical machine translation (SMT), since it a...
The performance of Phrase-Based Statis- tical Machine Translation (PBSMT) systems mostly depends on ...