We consider the task of tagging Slovene words with morphosyntactic descriptions (MSDs). MSDs contain not only part-of-speech information but also attributes such as gender and case. In the case of Slovene there are 2,083 possible MSDs. P-Progol was used to learn morphosyntactic disambiguation rules from annotated data (consisting of 161,314 examples) produced by the MULTEXT-East project. P-Progol produced 1,148 rules taking 36 hours. Using simple grammatical background knowledge, e.g. looking for case disagreement, P-Progol induced 4,094 clauses in eight parallel runs. These rules have proved effective at detecting and explaining incorrect MSD annotations in an independent test set, but have not so far produced a tagger comparable to other ...
We introduce the use of various tools for Slovenian language processing and adapt them for NLTK lib...
This paper deals with the development of morphosyntactic taggers for spoken varieties of the Slavic ...
Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP a...
We consider the task of tagging Slovene words with morphosyntactic descriptions (MSDs). MSDs contain...
We consider the task of tagging Slovene words with morphosyntactic descriptions (MSDs). MSDs contain...
The paper evaluates tagging techniques on a corpus of Slovene, where we are faced with a large numbe...
Part-of-speech tagging or, more accurately, morphosyntactic tagging, is a procedure that assigns to ...
The JOS morphosyntactic resources for Slovene consist of the specifications, lexicon, and two corpor...
This model for morphosyntactic annotation of non-standard Slovenian was built with the CLASSLA-Stanf...
Abstract. In this paper, we provide detailed insight on properties of errors generated by a stochast...
This model for morphosyntactic annotation of standard Slovenian was built with the CLASSLA-StanfordN...
The model for morphosyntactic annotation of standard Slovenian was built with the CLASSLA-StanfordNL...
This model for morphosyntactic annotation of standard Slovenian was built with the CLASSLA-StanfordN...
International audienceThe use of computer tools has led to major advances in the study of spoken lan...
Sloleks is a reference morphological lexicon of Slovene that was developed to be used in various NLP...
We introduce the use of various tools for Slovenian language processing and adapt them for NLTK lib...
This paper deals with the development of morphosyntactic taggers for spoken varieties of the Slavic ...
Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP a...
We consider the task of tagging Slovene words with morphosyntactic descriptions (MSDs). MSDs contain...
We consider the task of tagging Slovene words with morphosyntactic descriptions (MSDs). MSDs contain...
The paper evaluates tagging techniques on a corpus of Slovene, where we are faced with a large numbe...
Part-of-speech tagging or, more accurately, morphosyntactic tagging, is a procedure that assigns to ...
The JOS morphosyntactic resources for Slovene consist of the specifications, lexicon, and two corpor...
This model for morphosyntactic annotation of non-standard Slovenian was built with the CLASSLA-Stanf...
Abstract. In this paper, we provide detailed insight on properties of errors generated by a stochast...
This model for morphosyntactic annotation of standard Slovenian was built with the CLASSLA-StanfordN...
The model for morphosyntactic annotation of standard Slovenian was built with the CLASSLA-StanfordNL...
This model for morphosyntactic annotation of standard Slovenian was built with the CLASSLA-StanfordN...
International audienceThe use of computer tools has led to major advances in the study of spoken lan...
Sloleks is a reference morphological lexicon of Slovene that was developed to be used in various NLP...
We introduce the use of various tools for Slovenian language processing and adapt them for NLTK lib...
This paper deals with the development of morphosyntactic taggers for spoken varieties of the Slavic ...
Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP a...