The paper evaluates tagging techniques on a corpus of Slovene, where we are faced with a large number of possible word-class tags and only a small (hand-tagged) dataset. We report on training and testing of four different taggers on the Slovene MULTEXT-East corpus containing about 100.000 words and 1000 different morphosyntactic tags. Results show, first of all, that training times of the Maximu
Janes-Tag is a manually annotated corpus of Slovene Computer-Mediated Communication (CMC). It is mea...
Janes-Tag is a manually annotated corpus of Slovene Computer-Mediated Communication (CMC). It is mea...
The jos1M corpus contains 1 million words of sampled paragraphs from the FidaPLUS corpus. It is mean...
Part-of-speech tagging or, more accurately, morphosyntactic tagging, is a procedure that assigns to ...
We consider the task of tagging Slovene words with morphosyntactic descriptions (MSDs). MSDs contain...
We consider the task of tagging Slovene words with morphosyntactic descriptions (MSDs). MSDs contain...
Morphosyntactic tagging of Croatian texts is performed with stochastic taggersby using a language mo...
Abstract. We present results of an experiment dealing with combining outputs of five part-of-speech ...
This paper deals with the development of morphosyntactic taggers for spoken varieties of the Slavic ...
Abstract. In this paper, we provide detailed insight on properties of errors generated by a stochast...
his paper evaluates six commonly available parts-of-speech tagging tools over corpora other than tho...
This paper evaluates six commonly available parts-of-speech tagging tools over corpora other than th...
The JOS morphosyntactic resources for Slovene consist of the specifications, lexicon, and two corpor...
The jos1M corpus contains 1 million words of sampled paragraphs from the Gigafida corpus. It is mean...
Part-of-speech tagger for Slovene language implemented using convolutional and LSTM neural networks....
Janes-Tag is a manually annotated corpus of Slovene Computer-Mediated Communication (CMC). It is mea...
Janes-Tag is a manually annotated corpus of Slovene Computer-Mediated Communication (CMC). It is mea...
The jos1M corpus contains 1 million words of sampled paragraphs from the FidaPLUS corpus. It is mean...
Part-of-speech tagging or, more accurately, morphosyntactic tagging, is a procedure that assigns to ...
We consider the task of tagging Slovene words with morphosyntactic descriptions (MSDs). MSDs contain...
We consider the task of tagging Slovene words with morphosyntactic descriptions (MSDs). MSDs contain...
Morphosyntactic tagging of Croatian texts is performed with stochastic taggersby using a language mo...
Abstract. We present results of an experiment dealing with combining outputs of five part-of-speech ...
This paper deals with the development of morphosyntactic taggers for spoken varieties of the Slavic ...
Abstract. In this paper, we provide detailed insight on properties of errors generated by a stochast...
his paper evaluates six commonly available parts-of-speech tagging tools over corpora other than tho...
This paper evaluates six commonly available parts-of-speech tagging tools over corpora other than th...
The JOS morphosyntactic resources for Slovene consist of the specifications, lexicon, and two corpor...
The jos1M corpus contains 1 million words of sampled paragraphs from the Gigafida corpus. It is mean...
Part-of-speech tagger for Slovene language implemented using convolutional and LSTM neural networks....
Janes-Tag is a manually annotated corpus of Slovene Computer-Mediated Communication (CMC). It is mea...
Janes-Tag is a manually annotated corpus of Slovene Computer-Mediated Communication (CMC). It is mea...
The jos1M corpus contains 1 million words of sampled paragraphs from the FidaPLUS corpus. It is mean...