This thesis investigates how well machine learned classifiers can identify translated text, and the effect translationese may have in Statistical Machine Translation -- all in a Swedish-to-English, and reverse, context. Translationese is a term used to describe the dialect of a target language that is produced when a source text is translated. The systems trained for this thesis are SVM-based classifiers for identifying translationese, as well as translation and language models for Statistical Machine Translation. The classifiers successfully identified translationese in relation to non-translated text, and to some extent, also what source language the texts were translated from. In the SMT experiments, variation of the translation model wa...
Statistical Machine Translation (SMT) models learn how to translate by examining a bilingual paralle...
We present an extensive experimental study of Phrase-based Statistical Machine Translation, from the...
Machine translation outputs are not correct enough to be used as it is, except for the very simplest...
This thesis investigates how well machine learned classifiers can identify translated text, and the ...
This paper presents a machine learning approach to the study of translationese. The goal is to train...
As a contribution to the on-going discussions concerning what strategy to use when approaching a new...
In this paper we describe an approach to the identification of "translationese" based on monolingual...
In this paper we describe an approach to the identification of "translationese" based on monolingual...
In this paper we describe an approach to the identification of "translationese" based on monolingual...
Machine translation can be considered a highly interdisciplinary and multidisciplinary field because...
Machine translation can be considered a highly interdisciplinary and multidisciplinary field because...
One of the difficulties statistical machine translation (SMT) systems face are differences in word o...
Statistical machine translation (SMT) is an approach to Machine Translation (MT) that uses statistic...
Statistical Machine Translation (SMT) systems are based on bilingual sentence aligned data. The qual...
In this thesis, three possible aspects of using linguistic (i.e. morpho-syntactic) knowledge for sta...
Statistical Machine Translation (SMT) models learn how to translate by examining a bilingual paralle...
We present an extensive experimental study of Phrase-based Statistical Machine Translation, from the...
Machine translation outputs are not correct enough to be used as it is, except for the very simplest...
This thesis investigates how well machine learned classifiers can identify translated text, and the ...
This paper presents a machine learning approach to the study of translationese. The goal is to train...
As a contribution to the on-going discussions concerning what strategy to use when approaching a new...
In this paper we describe an approach to the identification of "translationese" based on monolingual...
In this paper we describe an approach to the identification of "translationese" based on monolingual...
In this paper we describe an approach to the identification of "translationese" based on monolingual...
Machine translation can be considered a highly interdisciplinary and multidisciplinary field because...
Machine translation can be considered a highly interdisciplinary and multidisciplinary field because...
One of the difficulties statistical machine translation (SMT) systems face are differences in word o...
Statistical machine translation (SMT) is an approach to Machine Translation (MT) that uses statistic...
Statistical Machine Translation (SMT) systems are based on bilingual sentence aligned data. The qual...
In this thesis, three possible aspects of using linguistic (i.e. morpho-syntactic) knowledge for sta...
Statistical Machine Translation (SMT) models learn how to translate by examining a bilingual paralle...
We present an extensive experimental study of Phrase-based Statistical Machine Translation, from the...
Machine translation outputs are not correct enough to be used as it is, except for the very simplest...