While the intuition that morphological preprocessing of languages in various applications can be beneficial appears to be often true, especially in the case of morphologically richer languages, it is not always the case. Previous work on translation between Nordic languages, including the morphologically rich Finnish, found that morphological analysis and preprocessing actually led to a decrease in translation quality below that of the unprocessed baseline. In this paper we investigate the proposition that the effect on translation quality depends on the kind of morphological preprocessing; and in particular that a specific kind of morphological preprocessing before translation could improve translation quality, a preprocessing that first t...
Abstract We propose a language-independent approach for improving statistical machine translation fo...
Morphological analysis is often used during preprocessing in Statistical Machine Trans-lation. Exist...
Most natural language applications have some degree of preprocessing of data: tokenisation, stemming...
Machine translation systems often incorporate modeling assumptions motivated by properties of the la...
In the framework of statistical machine translation (SMT), correspondences between the words in the ...
Various experiments from literature suggest that in statistical machine translation (SMT), applying ...
Abstract. This paper describes the integration of morpho-syntactic information in phrase-based and s...
Translating into morphologically rich languages is difficult. Although the coverage of lemmas may...
We present a novel morphological analysis technique which induces a morphological and syntactic sym...
In statistical machine translation, estimating word-to-word alignment probabilities for the translat...
We have investigated the potential for improvement in target language morphology when translating in...
We have investigated the potential for improvement in target language morphology when translating in...
We address the problem of translating from morphologically poor to morphologically rich languages by...
We propose a novel pipeline for translation into morphologically rich languages which consists of tw...
We tried to cope with the complex morphology of Turkish by applying different schemes of morphologic...
Abstract We propose a language-independent approach for improving statistical machine translation fo...
Morphological analysis is often used during preprocessing in Statistical Machine Trans-lation. Exist...
Most natural language applications have some degree of preprocessing of data: tokenisation, stemming...
Machine translation systems often incorporate modeling assumptions motivated by properties of the la...
In the framework of statistical machine translation (SMT), correspondences between the words in the ...
Various experiments from literature suggest that in statistical machine translation (SMT), applying ...
Abstract. This paper describes the integration of morpho-syntactic information in phrase-based and s...
Translating into morphologically rich languages is difficult. Although the coverage of lemmas may...
We present a novel morphological analysis technique which induces a morphological and syntactic sym...
In statistical machine translation, estimating word-to-word alignment probabilities for the translat...
We have investigated the potential for improvement in target language morphology when translating in...
We have investigated the potential for improvement in target language morphology when translating in...
We address the problem of translating from morphologically poor to morphologically rich languages by...
We propose a novel pipeline for translation into morphologically rich languages which consists of tw...
We tried to cope with the complex morphology of Turkish by applying different schemes of morphologic...
Abstract We propose a language-independent approach for improving statistical machine translation fo...
Morphological analysis is often used during preprocessing in Statistical Machine Trans-lation. Exist...
Most natural language applications have some degree of preprocessing of data: tokenisation, stemming...