We present a study of the adequacy of current methods that are used for POS-tagging historical Dutch texts, as well as an exploration of the influence of employing different techniques to improve upon the current practice. The main focus of this paper is on (unsupervised) methods that are easily adaptable for different domains without requiring extensive manual input. It was found that modernising the spelling of corpora prior to tagging them with a tagger trained on contemporary Dutch results in a large increase in accuracy, but that spelling normalisation alone is not sufficient to obtain state-of-the-art results. The best results were achieved by training a POS-tagger on a corpus automatically annotated by projecting (automatically assig...
Syntactically annotated corpora are highly important for enabling large-scale diachronic and diatopi...
In this paper a Neural Network is designed for Part-of-Speech Tagging of Dutch text. Our approach us...
In this paper we focus on automatic part-of-speech (POS) annotation, in the context of historical En...
This paper reports on the construction of a tagged and parsed pilot corpus of the southern Dutch dia...
We are annotating the complete 20 million Dutch PAROLE corpus with PoS and lemma. The morphosyntacti...
In this paper, we present experiments on POS tagging historical texts that contain spelling variatio...
This paper describes the application of a part-of-speech tagger to a particular configuration of his...
The CLIN27 shared task evaluates the effect of translating historical text to modern text with the g...
The CLIN27 shared task evaluates the effect of translating historical text to modern text with the g...
The CLIN27 shared task evaluates the effect of translating historical text to modern text with the g...
In this paper we focus on automatic part-of-speech (POS) annotation, in the context of historical En...
This paper describes the lemmatisation and tagging guidelines developed for the “Spoken Dutch Corpus...
We present a part of speech tagger (morphosyntactic disambiguator) for Dutch, constructed by means o...
In this article, we present a new corpus spanning 163 years of written Dutch. This Dutch Corpus of C...
In this paper we investigate whether, for Dutch open class words, it is possible to generate the sur...
Syntactically annotated corpora are highly important for enabling large-scale diachronic and diatopi...
In this paper a Neural Network is designed for Part-of-Speech Tagging of Dutch text. Our approach us...
In this paper we focus on automatic part-of-speech (POS) annotation, in the context of historical En...
This paper reports on the construction of a tagged and parsed pilot corpus of the southern Dutch dia...
We are annotating the complete 20 million Dutch PAROLE corpus with PoS and lemma. The morphosyntacti...
In this paper, we present experiments on POS tagging historical texts that contain spelling variatio...
This paper describes the application of a part-of-speech tagger to a particular configuration of his...
The CLIN27 shared task evaluates the effect of translating historical text to modern text with the g...
The CLIN27 shared task evaluates the effect of translating historical text to modern text with the g...
The CLIN27 shared task evaluates the effect of translating historical text to modern text with the g...
In this paper we focus on automatic part-of-speech (POS) annotation, in the context of historical En...
This paper describes the lemmatisation and tagging guidelines developed for the “Spoken Dutch Corpus...
We present a part of speech tagger (morphosyntactic disambiguator) for Dutch, constructed by means o...
In this article, we present a new corpus spanning 163 years of written Dutch. This Dutch Corpus of C...
In this paper we investigate whether, for Dutch open class words, it is possible to generate the sur...
Syntactically annotated corpora are highly important for enabling large-scale diachronic and diatopi...
In this paper a Neural Network is designed for Part-of-Speech Tagging of Dutch text. Our approach us...
In this paper we focus on automatic part-of-speech (POS) annotation, in the context of historical En...