In this paper Brill's rule-based PoS tagger is tested and adapted for Hungarian. It is shown that the present system does not obtain as high accuracy for Hungarian as it does for English (and other Germanic languages) because of the structural difference between these languages. Hungarian, unlike English, has rich morphology, is agglutinative with some inflectional characteristics and has fairly free word order. The tagger has the greatest difficulties with parts-of-speech belonging to open classes because of their complicated morphological structure. It is shown that the accuracy of tagging can be increased from approximately 83 % to 97 % by simply changing the rule generating mechanisms, namely the lexical templates in the lexical tr...
The use of a corpus as a language resource is enhanced when it is part of speech (POS) tagged. There...
Part-of-speech (POS) tagging is one of the most basic and crucial tasks in Natural Language Processi...
This rule based Tibetan part-of-speech (POS) tagger was prepared in the course of the research proje...
. From the point of view of computational linguistics, Hungarian is a difficult language due to its...
Many of the methods developed for Western European languages and used widespread to produce annotate...
We have trained the rule-based Brill-Tagger for German. In this paper we show how the tagging perfor...
This paper explores the relationship between the tagset design and linguistic properties of inflecte...
Linguistically annotated text resources are still scarce for many languages and for many text types,...
Part-of-speech tagging is a fundamental task of natural language processing. For languages with a ve...
POS tagging is assignment a label or a tag to a word in an exceedingly sentence consistent with its ...
We present and evaluate the implementation of Part of Speech (POS) Tagging for the Kadazan language ...
This paper describes an approach to POS tagging based on the automatic refinement of manually writte...
Part-of-speech (POS) tagging is a well-established technology for most Western European languages an...
The aim of this article is to show how automatic morphological tools originally used to analyze nati...
This work presents a part of a more global study on the problem of parsing of Czech and on the knowl...
The use of a corpus as a language resource is enhanced when it is part of speech (POS) tagged. There...
Part-of-speech (POS) tagging is one of the most basic and crucial tasks in Natural Language Processi...
This rule based Tibetan part-of-speech (POS) tagger was prepared in the course of the research proje...
. From the point of view of computational linguistics, Hungarian is a difficult language due to its...
Many of the methods developed for Western European languages and used widespread to produce annotate...
We have trained the rule-based Brill-Tagger for German. In this paper we show how the tagging perfor...
This paper explores the relationship between the tagset design and linguistic properties of inflecte...
Linguistically annotated text resources are still scarce for many languages and for many text types,...
Part-of-speech tagging is a fundamental task of natural language processing. For languages with a ve...
POS tagging is assignment a label or a tag to a word in an exceedingly sentence consistent with its ...
We present and evaluate the implementation of Part of Speech (POS) Tagging for the Kadazan language ...
This paper describes an approach to POS tagging based on the automatic refinement of manually writte...
Part-of-speech (POS) tagging is a well-established technology for most Western European languages an...
The aim of this article is to show how automatic morphological tools originally used to analyze nati...
This work presents a part of a more global study on the problem of parsing of Czech and on the knowl...
The use of a corpus as a language resource is enhanced when it is part of speech (POS) tagged. There...
Part-of-speech (POS) tagging is one of the most basic and crucial tasks in Natural Language Processi...
This rule based Tibetan part-of-speech (POS) tagger was prepared in the course of the research proje...