The training data size is of utmost importance for statistical machine translation (SMT), since it affects the training time, model size, decoding speed, as well as the system’s overall success. One of the challenges for developing SMT systems for languages with less resources is the lim-ited sizes of the available training data. In this paper, we propose an approach for expanding the training data by including parallel texts from an out-of-domain corpus. Selecting the best out-of-domain sentences for inclusion in the training set is important for the overall performance of the system. Our method is based on first ranking the out-of-domain sentences using a language modeling approach, and then, including the sentences to the training set by...
We investigate different representational granularities for sub-lexical representation in statistica...
Target task matched parallel corpora are re-quired for statistical translation model train-ing. Howe...
The performance of a machine translation system (MTS) depends on the quality and size of the trainin...
Parallel corpus is an indispensable resource for translation model training in statistical machine t...
The performance of Phrase-Based Statistical Machine Translation (PBSMT) systems mostly depends on ...
Statistical machine translation relies heavily on available parallel corpora, but SMT may not have t...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
A parallel corpus plays an important role in statistical machine translation (SMT) systems. In this ...
Machine translation is the application of machines to translate text or speech from one natural lang...
2014-07-28The goal of machine translation is to translate from one natural language into another usi...
Statistical Machine Translation (SMT) is an evolving field where many techniques in Syntactic Patter...
Statistical Machine Translation (SMT) models learn how to translate by examining a bilingual paralle...
Statistical machine translation, the task of translating text from one natural language into another...
Abstract—Text corpus size is an important issue when building a language model (LM) in particular wh...
Statistical Machine Translation (SMT) systems are usually trained on large amounts of bilingual text...
We investigate different representational granularities for sub-lexical representation in statistica...
Target task matched parallel corpora are re-quired for statistical translation model train-ing. Howe...
The performance of a machine translation system (MTS) depends on the quality and size of the trainin...
Parallel corpus is an indispensable resource for translation model training in statistical machine t...
The performance of Phrase-Based Statistical Machine Translation (PBSMT) systems mostly depends on ...
Statistical machine translation relies heavily on available parallel corpora, but SMT may not have t...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
A parallel corpus plays an important role in statistical machine translation (SMT) systems. In this ...
Machine translation is the application of machines to translate text or speech from one natural lang...
2014-07-28The goal of machine translation is to translate from one natural language into another usi...
Statistical Machine Translation (SMT) is an evolving field where many techniques in Syntactic Patter...
Statistical Machine Translation (SMT) models learn how to translate by examining a bilingual paralle...
Statistical machine translation, the task of translating text from one natural language into another...
Abstract—Text corpus size is an important issue when building a language model (LM) in particular wh...
Statistical Machine Translation (SMT) systems are usually trained on large amounts of bilingual text...
We investigate different representational granularities for sub-lexical representation in statistica...
Target task matched parallel corpora are re-quired for statistical translation model train-ing. Howe...
The performance of a machine translation system (MTS) depends on the quality and size of the trainin...