This paper reports the ongoing work of producing a state of the art part of speech tagger for unedited Swedish text. Rules eliminating faulty tags have been induced using Progol. In previously reported experiments, almost no linguistically motivated background knowledge was used [5, 8]. Still, the result was rather promising (recall 97.7%, with a pending average ambiguity of 1.13 tags/word). Compared to the previous study, a much richer, more linguistically motivated, background knowledge has been supplied, consisting of examples of noun phrases, verb chains, auxiliary verbs, and sets of part of speech categories. The aim has been to create the background knowledge rapidly, without laborious hand-coding of linguistic knowledge. In addition ...
Thesis (M.Sc. Engineering Sciences (Electrical and Electronic Engineering))--North-West University, ...
Automatic part of speech tagging is an area of natural lan-guage processing where statistical techni...
Linguistically annotated text resources are still scarce for many languages and for many text types,...
This work presents Stagger, a new open-source part of speech tagger for Swedish based on the Average...
We present an implementation of a part-of-speech tagger based on a hidden Markov model. The methodol...
{In this paper we present an evolutionary approach to the part-of-speech tagging problem. The goal o...
This work presents Stagger, a new open-source part of speech tagger for Swedish based on the Average...
The Icelandic language is a morphologically complex language, for which a large tagset has been crea...
The field of Part of Speech (POS) tagging has made slow but steady progress during the last decade, ...
This paper reports on two experiments with a probabilistic part-of-speech tagger, trained on a tagge...
Part of speech (POS) tagging is a process of identifying the part of speech of a word in a text. It ...
Parts of speech (POS) tagging is the process of assigning a word in a text as corresponding to a par...
A system for ‘tagging’ words with their part-of-speech (POS) tags is constructed. The system has two...
© 2005 Andrew MacKinlayIn natural language processing (NLP), a crucial subsystem in a wide range of ...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
Thesis (M.Sc. Engineering Sciences (Electrical and Electronic Engineering))--North-West University, ...
Automatic part of speech tagging is an area of natural lan-guage processing where statistical techni...
Linguistically annotated text resources are still scarce for many languages and for many text types,...
This work presents Stagger, a new open-source part of speech tagger for Swedish based on the Average...
We present an implementation of a part-of-speech tagger based on a hidden Markov model. The methodol...
{In this paper we present an evolutionary approach to the part-of-speech tagging problem. The goal o...
This work presents Stagger, a new open-source part of speech tagger for Swedish based on the Average...
The Icelandic language is a morphologically complex language, for which a large tagset has been crea...
The field of Part of Speech (POS) tagging has made slow but steady progress during the last decade, ...
This paper reports on two experiments with a probabilistic part-of-speech tagger, trained on a tagge...
Part of speech (POS) tagging is a process of identifying the part of speech of a word in a text. It ...
Parts of speech (POS) tagging is the process of assigning a word in a text as corresponding to a par...
A system for ‘tagging’ words with their part-of-speech (POS) tags is constructed. The system has two...
© 2005 Andrew MacKinlayIn natural language processing (NLP), a crucial subsystem in a wide range of ...
Proceedings of the 16th Nordic Conference of Computational Linguistics NODALIDA-2007. Editors: Jo...
Thesis (M.Sc. Engineering Sciences (Electrical and Electronic Engineering))--North-West University, ...
Automatic part of speech tagging is an area of natural lan-guage processing where statistical techni...
Linguistically annotated text resources are still scarce for many languages and for many text types,...