Compound splitting is an important prob-lem in many NLP applications which must be solved in order to address issues of data sparsity. Previous work has shown that lin-guistic approaches for German compound splitting produce a correct splitting more often, but corpus-driven approaches work best for phrase-based statistical machine translation from German to English, a worrisome contradiction. We address this situation by combining linguistic analysis with corpus-driven statistics and obtain-ing better results in terms of both produc-ing splittings according to a gold standard and statistical machine translation perfor-mance.
Unlike the English language, languages such as German, Dutch, the Skandinavian languages or Greek fo...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
Statistical Machine Translation from English to German is challenging due to the mor-phological rich...
Compound splitting is an important problem in many NLP applications which must be solved in order to...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
In this thesis I explore how compound processing can be used to improve phrase-based statistical mac...
The paper presents an approach to morphological compound splitting that takes the degree of composit...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
Compounding is present in a large variety of languages in different proportions. Compound rate in th...
Traditionally, compound splitters are evaluated intrinsically on gold-standard data or extrinsically...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
In this article, compound processing for translation into German in a factored sta-tistical MT syste...
In this thesis I aim to improve phrase-based statistical machine translation (PBSMT) in a number of ...
Lexical resources like GermaNet offer compound lists of reasonable size. These lists can be used as ...
Unlike the English language, languages such as German, Dutch, the Skandinavian languages or Greek fo...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
Statistical Machine Translation from English to German is challenging due to the mor-phological rich...
Compound splitting is an important problem in many NLP applications which must be solved in order to...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
In this thesis I explore how compound processing can be used to improve phrase-based statistical mac...
The paper presents an approach to morphological compound splitting that takes the degree of composit...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
Compounding is present in a large variety of languages in different proportions. Compound rate in th...
Traditionally, compound splitters are evaluated intrinsically on gold-standard data or extrinsically...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
In this article, compound processing for translation into German in a factored sta-tistical MT syste...
In this thesis I aim to improve phrase-based statistical machine translation (PBSMT) in a number of ...
Lexical resources like GermaNet offer compound lists of reasonable size. These lists can be used as ...
Unlike the English language, languages such as German, Dutch, the Skandinavian languages or Greek fo...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
Statistical Machine Translation from English to German is challenging due to the mor-phological rich...