International audienceCompounding is present in a large variety of languages in different proportions. Compound rate in the text obviously depends on the language, but also on the genre and the domain. Scientific and technical texts are especially conducive to compounding, even in the languages that are not traditionally admitted as highly compounding ones. In this article we address compound splitting of specialized terms. We propose a multi-lingual method of compound recognition and splitting, which uses corpus frequencies, lexical data and optionally linguistic rules. This is a supervised method which requires a small amount of segmented compounds as input. We evaluate the method on two languages that rarely serve as a material for autom...
This article was supported by the German Research Foundation (DFG) and the Open Access Publication F...
“Compounding explosion” in modern Russian journalese discourseThe author of the p...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
International audienceCompounding is present in a large variety of languages in different proportion...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
The number of specialized terms continuously grows in the documents, at a pace which is difficult t...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
Unlike the English language, languages such as German, Dutch, the Skandinavian languages or Greek fo...
Abstract. We present an approach for knowledge-free and unsuper-vised recognition of compound nouns ...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
National audienceCompounding is a common phenomenon for many languages, especially those with a rich...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
Finding a definition of compoundhood that is cross-lingually valid is a non-trivial task as shown by...
The paper presents an approach to morphological compound splitting that takes the degree of composit...
In Technical Report No. 75 I proposed a method for describing compound words in Finnish. The aim in ...
This article was supported by the German Research Foundation (DFG) and the Open Access Publication F...
“Compounding explosion” in modern Russian journalese discourseThe author of the p...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
International audienceCompounding is present in a large variety of languages in different proportion...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
The number of specialized terms continuously grows in the documents, at a pace which is difficult t...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
Unlike the English language, languages such as German, Dutch, the Skandinavian languages or Greek fo...
Abstract. We present an approach for knowledge-free and unsuper-vised recognition of compound nouns ...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
National audienceCompounding is a common phenomenon for many languages, especially those with a rich...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
Finding a definition of compoundhood that is cross-lingually valid is a non-trivial task as shown by...
The paper presents an approach to morphological compound splitting that takes the degree of composit...
In Technical Report No. 75 I proposed a method for describing compound words in Finnish. The aim in ...
This article was supported by the German Research Foundation (DFG) and the Open Access Publication F...
“Compounding explosion” in modern Russian journalese discourseThe author of the p...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...