We outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German (MLG, 1200-1650), which will form part of the Corpus of Historical Low German (CHLG). The attestation for MLG is rich, but the syntax of the language remains relatively understudied. The development of a syntactically annotated corpus for the language will facilitate future studies with a strong empirical basis, building on recent work which indicates that, syntactically, MLG occupies a position in its own right within West Germanic. In this paper, we describe the background for the corpus and the process by which texts were selected to be included. In particular, we focus on the decisions involved in the syntactic annotation of the corpus, s...
The project "Reference Corpus Middle Low German/Low Rhenish (1200–1650)", abbreviated as "ReN", is p...
Over the last few decades, the wide diffusion of digital technology and the growing ease of transfer...
Traditionally, deep, wide-coverage linguistic resources are hand-crafted and their creation is time-...
We outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German ...
Syntactically annotated corpora are highly important for enabling large-scale diachronic and diatopi...
Our paper focuses on the one hand on the challenges posed by the structural variability, flexibility...
Manual development of deep linguistic resources is time-consuming and costly and therefore often des...
Constituency parsing plays a fundamental role in advancing natural language processing (NLP) tasks. ...
This paper describes the application of a part-of-speech tagger to a particular configuration of his...
We present data-driven methods for the acquisition of LFG resources from two German treebanks. We di...
We present a new comprehensive dataset for the unstandardised West-Germanic language Low Saxon cover...
The purpose of this paper is to describe the TüBa-D/Z treebank of written German and to compare it t...
Large-scale diachronic corpus studies covering longer time periods are difficult if more than one co...
This paper profiles significant differences in syntactic distribution and differences in word class ...
Search the corpus in ANNIS | Korpussuche in ANNIS The project "Reference Corpus Middle Low German/L...
The project "Reference Corpus Middle Low German/Low Rhenish (1200–1650)", abbreviated as "ReN", is p...
Over the last few decades, the wide diffusion of digital technology and the growing ease of transfer...
Traditionally, deep, wide-coverage linguistic resources are hand-crafted and their creation is time-...
We outline the issues and decisions involved in creating a Penn-style treebank of Middle Low German ...
Syntactically annotated corpora are highly important for enabling large-scale diachronic and diatopi...
Our paper focuses on the one hand on the challenges posed by the structural variability, flexibility...
Manual development of deep linguistic resources is time-consuming and costly and therefore often des...
Constituency parsing plays a fundamental role in advancing natural language processing (NLP) tasks. ...
This paper describes the application of a part-of-speech tagger to a particular configuration of his...
We present data-driven methods for the acquisition of LFG resources from two German treebanks. We di...
We present a new comprehensive dataset for the unstandardised West-Germanic language Low Saxon cover...
The purpose of this paper is to describe the TüBa-D/Z treebank of written German and to compare it t...
Large-scale diachronic corpus studies covering longer time periods are difficult if more than one co...
This paper profiles significant differences in syntactic distribution and differences in word class ...
Search the corpus in ANNIS | Korpussuche in ANNIS The project "Reference Corpus Middle Low German/L...
The project "Reference Corpus Middle Low German/Low Rhenish (1200–1650)", abbreviated as "ReN", is p...
Over the last few decades, the wide diffusion of digital technology and the growing ease of transfer...
Traditionally, deep, wide-coverage linguistic resources are hand-crafted and their creation is time-...