<p>The transformations are specific to Old Icelandic. Their purpose is to improve classification performance by making the classifier more robust with respect to errors introduced earlier in the IceMorph system, such as OCR errors or differences in spelling convention between words in the corpus and dictionary sources.</p
We describe the background for and building of IcePaHC, a one million word parsed historical corpus ...
We present an overview of an ongoing project which has the aim of developing methods for building a ...
We model the edit distance as a function in a labelling space. A labelling space is an Euclidean spa...
We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to mac...
We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to mac...
Historical texts are an important resource for researchers in the humanities. However, standard NLP ...
<p>Three edit operations (i.e., two substitutions and one insertion) are required to transform “antl...
This dataset consists of four main resources: a concatenated dictionary of Old Icelandic parsed for ...
Uninhabited mistakes while writing happens are unstoppable. There are certain common errors that occ...
The edit distance (or Levenshtein distance) between two strings x, y is the minimum number of charac...
Abstract: The Levenshtein distance is an established metric to represent phono-logical distances bet...
Traditional spelling correction Looks for word forms which are not valid This is often insufficien...
The topic of this paper is the linking of two major lexicographic resources on Icelandic, the Dictio...
We consider the isolated spelling error correction problem as a specific subproblem of the more gene...
We consider the isolated spelling error correction problem as a specific subproblem of the more gene...
We describe the background for and building of IcePaHC, a one million word parsed historical corpus ...
We present an overview of an ongoing project which has the aim of developing methods for building a ...
We model the edit distance as a function in a labelling space. A labelling space is an Euclidean spa...
We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to mac...
We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to mac...
Historical texts are an important resource for researchers in the humanities. However, standard NLP ...
<p>Three edit operations (i.e., two substitutions and one insertion) are required to transform “antl...
This dataset consists of four main resources: a concatenated dictionary of Old Icelandic parsed for ...
Uninhabited mistakes while writing happens are unstoppable. There are certain common errors that occ...
The edit distance (or Levenshtein distance) between two strings x, y is the minimum number of charac...
Abstract: The Levenshtein distance is an established metric to represent phono-logical distances bet...
Traditional spelling correction Looks for word forms which are not valid This is often insufficien...
The topic of this paper is the linking of two major lexicographic resources on Icelandic, the Dictio...
We consider the isolated spelling error correction problem as a specific subproblem of the more gene...
We consider the isolated spelling error correction problem as a specific subproblem of the more gene...
We describe the background for and building of IcePaHC, a one million word parsed historical corpus ...
We present an overview of an ongoing project which has the aim of developing methods for building a ...
We model the edit distance as a function in a labelling space. A labelling space is an Euclidean spa...