We tackle the problem of domain adapta-tion of Statistical Machine Translation by exploiting domain-specific data acquired by domain-focused web-crawling. We de-sign and evaluate a procedure for auto-matic acquisition of monolingual and par-allel data and their exploitation for train-ing, tuning, and testing in a phrase-based Statistical Machine Translation system. We present a strategy for using such resources depending on their availability and quan-tity supported by results of a large-scale evaluation on the domains of Natural En-vironment and Labour Legislation and two language pairs: English–French, English--Greek. The average observed increase of BLEU is substantial at 49.5 % relative.
Differences in domains of language use between training data and test data have often been reported ...
Nous avons observé depuis plusieurs années l’émergence des approches statistiques pour la traduction...
In this paper, we introduce a simple technique for incorporating domain information into a statistic...
The Author(s) 2015. This article is published with open access at Springerlink.com Abstract In this ...
In this paper, we tackle the problem of domain adaptation of statistical machine translation (SMT) b...
This paper reports on the ongoing work focused on domain adaptation of statistical machine translati...
© Cambridge University Press, 2015.Statistical machine translation (SMT) is gaining interest given t...
Domain adaptation has recently gained interest in statistical machine translation to cope with the p...
Abstract tra Statistical machine translation systems are usually trained on large amounts of bilingu...
Globalization suddenly brings many people from different country to interact with each other, requir...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
This paper describes adapting statistical machine translation (SMT) systems to medical domain using ...
A data-driven approach to model translation suffers from the data mismatch problem and demands domai...
In this thesis we develop and evaluate a general framework for domain-adaptation of statistical mach...
Machine translation research has progressed in recent years thanks to statistical machine learning m...
Differences in domains of language use between training data and test data have often been reported ...
Nous avons observé depuis plusieurs années l’émergence des approches statistiques pour la traduction...
In this paper, we introduce a simple technique for incorporating domain information into a statistic...
The Author(s) 2015. This article is published with open access at Springerlink.com Abstract In this ...
In this paper, we tackle the problem of domain adaptation of statistical machine translation (SMT) b...
This paper reports on the ongoing work focused on domain adaptation of statistical machine translati...
© Cambridge University Press, 2015.Statistical machine translation (SMT) is gaining interest given t...
Domain adaptation has recently gained interest in statistical machine translation to cope with the p...
Abstract tra Statistical machine translation systems are usually trained on large amounts of bilingu...
Globalization suddenly brings many people from different country to interact with each other, requir...
Abstract. Statistical Machine Translation (SMT) systems are usually trained on large amounts of bili...
This paper describes adapting statistical machine translation (SMT) systems to medical domain using ...
A data-driven approach to model translation suffers from the data mismatch problem and demands domai...
In this thesis we develop and evaluate a general framework for domain-adaptation of statistical mach...
Machine translation research has progressed in recent years thanks to statistical machine learning m...
Differences in domains of language use between training data and test data have often been reported ...
Nous avons observé depuis plusieurs années l’émergence des approches statistiques pour la traduction...
In this paper, we introduce a simple technique for incorporating domain information into a statistic...