This paper reports on the construction of a tagged and parsed pilot corpus of the southern Dutch dialects. The corpus aims to facilitate diachronic research into the syntax of Dutch, as its dialects have retained many interesting (morpho)syntactic features which can often be traced back to changes starting in or characteristics retained from older stages of historical Dutch. The discussion mainly focuses on initial test results achieved by applying existing NLP tools which have been developed or optimised for POS tagging and parsing standard Dutch. We report on initial tests on our data with Frog, TreeTagger and Alpino. We discuss some of the challenges we have encountered working with spoken, unstandardised language in general on the one h...
The computational linguistics community in The Netherlands and Belgium has long recognized the dire ...
This paper discusses some of the advantages and disadvantages of the various choices we had to make ...
The computational linguistics community in The Netherlands and Belgium has long recognized the dire ...
In this paper, we report on the construction of a linguistically annotated pilot corpus of the south...
We present a study of the adequacy of current methods that are used for POS-tagging historical Dutch...
This paper describes the lemmatisation and tagging guidelines developed for the “Spoken Dutch Corpus...
This paper discusses the design, methodology and results of the project Syntactic Atlas of the Dutch...
In this paper the ANNO Project ("Een Geannoteerde Publieke Gegevensbank voor het Geschreven Ned...
After the successful completion of the Spoken Dutch Corpus (1998 – 2003) the time is ripe to take so...
The Spoken Dutch Corpus that is currently under construction will constitute a 10-million-word corpu...
The advent of Early Modern Dutch (starting ∼1550) marked significant developments in language use in...
The Spoken Dutch Corpus (CGN) is a major new resource for research into contemporary spoken Dutch. A...
We are annotating the complete 20 million Dutch PAROLE corpus with PoS and lemma. The morphosyntacti...
We describe the development of a Dutch memory-based shallow parser. The availability of large treeba...
For the first time in the history of Dutch dialectology, a detailed overview of the variation in the...
The computational linguistics community in The Netherlands and Belgium has long recognized the dire ...
This paper discusses some of the advantages and disadvantages of the various choices we had to make ...
The computational linguistics community in The Netherlands and Belgium has long recognized the dire ...
In this paper, we report on the construction of a linguistically annotated pilot corpus of the south...
We present a study of the adequacy of current methods that are used for POS-tagging historical Dutch...
This paper describes the lemmatisation and tagging guidelines developed for the “Spoken Dutch Corpus...
This paper discusses the design, methodology and results of the project Syntactic Atlas of the Dutch...
In this paper the ANNO Project ("Een Geannoteerde Publieke Gegevensbank voor het Geschreven Ned...
After the successful completion of the Spoken Dutch Corpus (1998 – 2003) the time is ripe to take so...
The Spoken Dutch Corpus that is currently under construction will constitute a 10-million-word corpu...
The advent of Early Modern Dutch (starting ∼1550) marked significant developments in language use in...
The Spoken Dutch Corpus (CGN) is a major new resource for research into contemporary spoken Dutch. A...
We are annotating the complete 20 million Dutch PAROLE corpus with PoS and lemma. The morphosyntacti...
We describe the development of a Dutch memory-based shallow parser. The availability of large treeba...
For the first time in the history of Dutch dialectology, a detailed overview of the variation in the...
The computational linguistics community in The Netherlands and Belgium has long recognized the dire ...
This paper discusses some of the advantages and disadvantages of the various choices we had to make ...
The computational linguistics community in The Netherlands and Belgium has long recognized the dire ...