Knowledge about derivational morphology has been proven useful for a number of natural language processing (NLP) tasks. We describe the construction and evaluation of DERIVBASE.HR, a large-coverage morphological resource for Croatian. DERIVBASE.HR groups 100k lemmas from web corpus hrWaC into 56k clusters of derivationally related lemmas, so-called derivational families. We focus on suffixal derivation between and within nouns, verbs, and adjectives. We propose two approaches: an unsupervised approach and a knowledge-based approach based on a hand-crafted morphology model but without using any additional lexico-semantic resources. The resource acquisition procedure consists of three steps: corpus preprocessing, acquisition of an inflectiona...
Derivational morphology proposes meaningful connections between words and is largely unrepresented i...
In this paper, the authors present NooJ morphological grammars for recogniz-ing Croatian diminutive ...
The paper introduces the DeriNet lexical database, which includes more than 969,000 Czech words inte...
This paper presents experiments for enlarging the Croatian Morphological Lexicon by applying an auto...
In a morphological lexicon, each entry combines a lemma with a specific inflection class, often de...
The aim of this paper is to describe an efficient tool (I PAR) for a supervised and semi-automatic e...
Morphological analysis is a prerequisite for many natural language processing tasks. For inflectiona...
Abstract: This paper describes methodology for automatic morphological generation and analysis using...
Although it has long been an under-researched topic in the field of applied linguistics, morphologic...
International audienceTraditionally produced lexical resources for Serbo-Croatian are not suitable f...
This paper describes methods used for generating a morphological lexicon of organization entity name...
The main objective of this paper is to detect and describe major derivational processes and affixes ...
The computational linguistics world is gradually focussing its interests in researching and buildin...
This paper deals with semi-automatic extension of CroDeriV with verb va-lency frames. CroDeriV is a ...
Since Croatian is a highly flective language there is a need for morphological normalization of natu...
Derivational morphology proposes meaningful connections between words and is largely unrepresented i...
In this paper, the authors present NooJ morphological grammars for recogniz-ing Croatian diminutive ...
The paper introduces the DeriNet lexical database, which includes more than 969,000 Czech words inte...
This paper presents experiments for enlarging the Croatian Morphological Lexicon by applying an auto...
In a morphological lexicon, each entry combines a lemma with a specific inflection class, often de...
The aim of this paper is to describe an efficient tool (I PAR) for a supervised and semi-automatic e...
Morphological analysis is a prerequisite for many natural language processing tasks. For inflectiona...
Abstract: This paper describes methodology for automatic morphological generation and analysis using...
Although it has long been an under-researched topic in the field of applied linguistics, morphologic...
International audienceTraditionally produced lexical resources for Serbo-Croatian are not suitable f...
This paper describes methods used for generating a morphological lexicon of organization entity name...
The main objective of this paper is to detect and describe major derivational processes and affixes ...
The computational linguistics world is gradually focussing its interests in researching and buildin...
This paper deals with semi-automatic extension of CroDeriV with verb va-lency frames. CroDeriV is a ...
Since Croatian is a highly flective language there is a need for morphological normalization of natu...
Derivational morphology proposes meaningful connections between words and is largely unrepresented i...
In this paper, the authors present NooJ morphological grammars for recogniz-ing Croatian diminutive ...
The paper introduces the DeriNet lexical database, which includes more than 969,000 Czech words inte...