Compounding is present in a large variety of languages in different proportions. Compound rate in the text obviously depends on the language, but also on the genre and the domain. Scientific and technical texts are especially conducive to compounding, even in the languages that are not traditionally admitted as highly compounding ones. In this article we address compound splitting of specialized terms. We propose a multi-lingual method of compound recognition and splitting, which uses corpus frequencies, lexical data and optionally linguistic rules. This is a supervised method which requires a small amount of segmented compounds as input. We evaluate the method on two languages that rarely serve as a material for automatic splitting systems...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
International audienceMultilingual terminology acquisition from comparable corpora has been attracti...
In this thesis I explore how compound processing can be used to improve phrase-based statistical mac...
International audienceCompounding is present in a large variety of languages in different proportion...
Unlike the English language, languages such as German, Dutch, the Skandinavian languages or Greek fo...
Abstract. We present an approach for knowledge-free and unsuper-vised recognition of compound nouns ...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
The paper presents an approach to morphological compound splitting that takes the degree of composit...
Finding a definition of compoundhood that is cross-lingually valid is a non-trivial task as shown by...
The number of specialized terms continuously grows in the documents, at a pace which is difficult t...
International audienceThe terminology of any language and any domain continuously evolves and leads ...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
Compound splitting is an important prob-lem in many NLP applications which must be solved in order t...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
International audienceMultilingual terminology acquisition from comparable corpora has been attracti...
In this thesis I explore how compound processing can be used to improve phrase-based statistical mac...
International audienceCompounding is present in a large variety of languages in different proportion...
Unlike the English language, languages such as German, Dutch, the Skandinavian languages or Greek fo...
Abstract. We present an approach for knowledge-free and unsuper-vised recognition of compound nouns ...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
The paper presents an approach to morphological compound splitting that takes the degree of composit...
Finding a definition of compoundhood that is cross-lingually valid is a non-trivial task as shown by...
The number of specialized terms continuously grows in the documents, at a pace which is difficult t...
International audienceThe terminology of any language and any domain continuously evolves and leads ...
In this work, we present a novel compound splitting method for German by capturing the compound prod...
Compound splitting is an important prob-lem in many NLP applications which must be solved in order t...
In this article we investigate statistical machine translation (SMT) into Germanic languages, with a...
Compounds pose a problem for applications that rely on precise word alignments such as bilingual ter...
International audienceMultilingual terminology acquisition from comparable corpora has been attracti...
In this thesis I explore how compound processing can be used to improve phrase-based statistical mac...