We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantage of the analyzer is its ability to achieve competitive classification accuracy with minimum training data
Foreign words, such as names, locations or sometimes entire phrases, are a problem for any system th...
In this paper we describe and evaluate a tool for paradigm induction and lexicon extraction that has...
The Icelandic language is a morphologically complex language, for which a large tagset has been crea...
We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to mac...
This dataset consists of four main resources: a concatenated dictionary of Old Icelandic parsed for ...
We describe the Corpus of Spoken Icelandic (ÍS-TAL) which is made up of 15 hours of spontaneous natu...
~aturallanguageprocessing (~LP) is a very young discipline in Iceland. Therefore, there is a lack of...
In this paper, we experiment with using Stagger, an open-source implementation of an Averaged Percep...
In this paper; we experiment with using Stagger; an open-source implementation of an Averaged Percep...
We present an overview of an ongoing project which has the aim of developing methods for building a ...
We describe experiments with morphosyntactic tagging of Old Norse narrative texts using different ta...
We describe the background for and building of IcePaHC, a one million word parsed historical corpus ...
This book showcases the state of the art in corpus-based linguistic analysis of Celtic languages (sp...
<p>The transformations are specific to Old Icelandic. Their purpose is to improve classification per...
The new POS-tagged Icelandic corpus of the Leipzig Corpora Collection is an extensive resource for t...
Foreign words, such as names, locations or sometimes entire phrases, are a problem for any system th...
In this paper we describe and evaluate a tool for paradigm induction and lexicon extraction that has...
The Icelandic language is a morphologically complex language, for which a large tagset has been crea...
We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to mac...
This dataset consists of four main resources: a concatenated dictionary of Old Icelandic parsed for ...
We describe the Corpus of Spoken Icelandic (ÍS-TAL) which is made up of 15 hours of spontaneous natu...
~aturallanguageprocessing (~LP) is a very young discipline in Iceland. Therefore, there is a lack of...
In this paper, we experiment with using Stagger, an open-source implementation of an Averaged Percep...
In this paper; we experiment with using Stagger; an open-source implementation of an Averaged Percep...
We present an overview of an ongoing project which has the aim of developing methods for building a ...
We describe experiments with morphosyntactic tagging of Old Norse narrative texts using different ta...
We describe the background for and building of IcePaHC, a one million word parsed historical corpus ...
This book showcases the state of the art in corpus-based linguistic analysis of Celtic languages (sp...
<p>The transformations are specific to Old Icelandic. Their purpose is to improve classification per...
The new POS-tagged Icelandic corpus of the Leipzig Corpora Collection is an extensive resource for t...
Foreign words, such as names, locations or sometimes entire phrases, are a problem for any system th...
In this paper we describe and evaluate a tool for paradigm induction and lexicon extraction that has...
The Icelandic language is a morphologically complex language, for which a large tagset has been crea...