The paper discusses from various angles the morphosyntactic annotation of DeReKo, the Archive of General Reference Corpora of Contemporary Written German at the Institut für Deutsche Sprache (IDS), Mannheim. The paper is divided into two parts. The first part covers the practical and technical aspects of this endeavor. We present results from a recent evaluation of tools for the annotation of German text resources that have been applied to DeReKo. These tools include commercial products, especially Xerox' Finite State Tools and the Machinese products developed by the Finnish company Connexor Oy, as well as software for which academic licenses are available free of charge for academic institutions, e.g. Helmut Schmid's Tree Tagger. The secon...
This paper addresses methodological issues of concern to the study of morphosyntactic variation. Whi...
The Stuttgart-Tübingen Tagset (STTS) is a widely used POS annotation scheme for German which provide...
A case study based on experience in linguistic investigations using annotated monolingual and multil...
The paper discusses from various angles the morphosyntactic annotation of DeReKo, the Archive of Gen...
This document and additional documentation for DEREKO can be found at the project web site: http://w...
This paper describes DEREKO, the archive of general reference corpora of contemporary written German...
^This paper describes DeReKo (Deutsches Referenzkorpus), the Archive of General Reference Corpora of...
The parsing of natural language relies on the syntactic characteristics of words. The part of speec...
This article discusses questions concerning the creation, annotation and sharing of spoken language ...
Corpora, i.e. collections of linguistic data (texts or conversations), are a fundamental asset of di...
Our paper focuses on the one hand on the challenges posed by the structural variability, flexibility...
The purpose of this paper is to describe recent developments in the morphological, syntactic, and se...
This paper reports on the TIGER Treebank, a corpus of currently 35.000 syntactically annotated Germa...
The Prague Dependency Treebank has been conceived of as a semi-automatic three-layer annotation syst...
In recent years, the availability of large annotated and searchable corpora, together with a new int...
This paper addresses methodological issues of concern to the study of morphosyntactic variation. Whi...
The Stuttgart-Tübingen Tagset (STTS) is a widely used POS annotation scheme for German which provide...
A case study based on experience in linguistic investigations using annotated monolingual and multil...
The paper discusses from various angles the morphosyntactic annotation of DeReKo, the Archive of Gen...
This document and additional documentation for DEREKO can be found at the project web site: http://w...
This paper describes DEREKO, the archive of general reference corpora of contemporary written German...
^This paper describes DeReKo (Deutsches Referenzkorpus), the Archive of General Reference Corpora of...
The parsing of natural language relies on the syntactic characteristics of words. The part of speec...
This article discusses questions concerning the creation, annotation and sharing of spoken language ...
Corpora, i.e. collections of linguistic data (texts or conversations), are a fundamental asset of di...
Our paper focuses on the one hand on the challenges posed by the structural variability, flexibility...
The purpose of this paper is to describe recent developments in the morphological, syntactic, and se...
This paper reports on the TIGER Treebank, a corpus of currently 35.000 syntactically annotated Germa...
The Prague Dependency Treebank has been conceived of as a semi-automatic three-layer annotation syst...
In recent years, the availability of large annotated and searchable corpora, together with a new int...
This paper addresses methodological issues of concern to the study of morphosyntactic variation. Whi...
The Stuttgart-Tübingen Tagset (STTS) is a widely used POS annotation scheme for German which provide...
A case study based on experience in linguistic investigations using annotated monolingual and multil...