The field of Part of Speech (POS) tagging has made slow but steady progress during the last decade, though many of the new methods developed have not previously been applied to Swedish. I present a new system, based on the Averaged Perceptron algorithm and semi-supervised learning, that is more accurate than previous Swedish POS taggers. Furthermore, a new version of the Stockholm-Umeå Corpus is presented, whose more consistent annotation leads to significantly lower error rates for the POS tagger. Finally, a new, freely available annotated corpus of Swedish blog posts is presented and used to evaluate the tagger’s accuracy on this increasingly important genre. Details of the evaluation are presented throughout, to ensure easy comparison wi...
Many parsers use a part-of-speech tagger as a first step in parsing. The accuracy of the tagger natur...
In this paper, we describe the development of a new tagged corpus of Icelandic, consisting of about ...
Part-of-speech tagging (POS-tagging) of spoken data requires different means of annotation than POS-...
This work presents Stagger, a new open-source part of speech tagger for Swedish based on the Average...
This work presents Stagger, a new open-source part of speech tagger for Swedish based on the Average...
In this paper a data-driven method for Part-of-Speech tagging not using any n-grams of tags is prese...
State-of-the-art statistical part-of-speech taggers mainly use information on tag bi- or trigrams, d...
HunPoS, a freely available open source part-of-speech tagger—a reimplementa-tion of one of the best ...
This paper reports the ongoing work of producing a state of the art part of speech tagger for unedit...
There is an increasing interest in the NLP community in developing tools for annotating historical d...
This paper reports on two experiments with a probabilistic part-of-speech tagger, trained on a tagge...
This paper explores the impact of inconsistencies stemming from human mistakes on the accuracy of pa...
ABSTRACT There is an increasing interest in the NLP community in developing tools for annotating his...
This thesis describes the work of providing separate morphological processing and part-of-speech tag...
Part-of-speech tagging (POS-tagging) of spoken data requires different means of annotation than POS-...
Many parsers use a part-of-speech tagger as a first step in parsing. The accuracy of the tagger natur...
In this paper, we describe the development of a new tagged corpus of Icelandic, consisting of about ...
Part-of-speech tagging (POS-tagging) of spoken data requires different means of annotation than POS-...
This work presents Stagger, a new open-source part of speech tagger for Swedish based on the Average...
This work presents Stagger, a new open-source part of speech tagger for Swedish based on the Average...
In this paper a data-driven method for Part-of-Speech tagging not using any n-grams of tags is prese...
State-of-the-art statistical part-of-speech taggers mainly use information on tag bi- or trigrams, d...
HunPoS, a freely available open source part-of-speech tagger—a reimplementa-tion of one of the best ...
This paper reports the ongoing work of producing a state of the art part of speech tagger for unedit...
There is an increasing interest in the NLP community in developing tools for annotating historical d...
This paper reports on two experiments with a probabilistic part-of-speech tagger, trained on a tagge...
This paper explores the impact of inconsistencies stemming from human mistakes on the accuracy of pa...
ABSTRACT There is an increasing interest in the NLP community in developing tools for annotating his...
This thesis describes the work of providing separate morphological processing and part-of-speech tag...
Part-of-speech tagging (POS-tagging) of spoken data requires different means of annotation than POS-...
Many parsers use a part-of-speech tagger as a first step in parsing. The accuracy of the tagger natur...
In this paper, we describe the development of a new tagged corpus of Icelandic, consisting of about ...
Part-of-speech tagging (POS-tagging) of spoken data requires different means of annotation than POS-...