Lemmatization is the process of finding the normalized form of words from surface word-forms as they appear in the running text. It is a useful pre-processing step for any number of language engineering tasks, esp. important for languages with rich inflection morphology. This paper presents two approaches to automated word lemmatization, which both use machine learning techniques to learn particular language models from pre-annotated data. One approach is based on Ripple Down Rules and the other on First-Order Decision Lists as learned by the CLog system. We have tested the two approaches on the Slovene language and set-up a generally accessible Web service for lemmatization using the generated models. 1
The model for lemmatisation of non-standard Slovenian was built with the CLASSLA-StanfordNLP tool (h...
This thesis is focused on lemmatizing of nouns and adjectives. It is based on morphology of Czech la...
The model for lemmatisation of non-standard Serbian was built with the CLASSLA-StanfordNLP tool (htt...
Lemmatization is the process of finding the normalized form of a word. It is the same as looking for...
Abstract: Lemmatisation is the process of finding the normalised forms of words appearing in text. I...
Lemmatization for languages with rich inflectional morphology is one of the basic, indispensable ste...
Abstract. Identifying the lemma of a Named Entity is important for many Natural Language Processing ...
The model for lemmatisation of non-standard Slovenian was built with the CLASSLA-StanfordNLP tool (h...
The model for lemmatisation of standard Slovenian was built with the CLASSLA-StanfordNLP tool (https...
We present a novel tool for morphological analysis of Serbian, which is a low-resource language with...
The paper presents the implementation and evaluation of a module for full lemma-tization of Croatian...
Aim of this bachelor thesis was to become familiar with the tools and methods for morphological anal...
Morphological analysis is used to study the internal structure words by reducing the number of vocab...
Abstract. This paper deals with the automatic construction of a lem-matizer from a Full Form- Lemma ...
In this paper we present the pipeline of recently developed language technology tools for Slovene, C...
The model for lemmatisation of non-standard Slovenian was built with the CLASSLA-StanfordNLP tool (h...
This thesis is focused on lemmatizing of nouns and adjectives. It is based on morphology of Czech la...
The model for lemmatisation of non-standard Serbian was built with the CLASSLA-StanfordNLP tool (htt...
Lemmatization is the process of finding the normalized form of a word. It is the same as looking for...
Abstract: Lemmatisation is the process of finding the normalised forms of words appearing in text. I...
Lemmatization for languages with rich inflectional morphology is one of the basic, indispensable ste...
Abstract. Identifying the lemma of a Named Entity is important for many Natural Language Processing ...
The model for lemmatisation of non-standard Slovenian was built with the CLASSLA-StanfordNLP tool (h...
The model for lemmatisation of standard Slovenian was built with the CLASSLA-StanfordNLP tool (https...
We present a novel tool for morphological analysis of Serbian, which is a low-resource language with...
The paper presents the implementation and evaluation of a module for full lemma-tization of Croatian...
Aim of this bachelor thesis was to become familiar with the tools and methods for morphological anal...
Morphological analysis is used to study the internal structure words by reducing the number of vocab...
Abstract. This paper deals with the automatic construction of a lem-matizer from a Full Form- Lemma ...
In this paper we present the pipeline of recently developed language technology tools for Slovene, C...
The model for lemmatisation of non-standard Slovenian was built with the CLASSLA-StanfordNLP tool (h...
This thesis is focused on lemmatizing of nouns and adjectives. It is based on morphology of Czech la...
The model for lemmatisation of non-standard Serbian was built with the CLASSLA-StanfordNLP tool (htt...