This article presents the corpus REDEWIEDERGABE, a German-language historical corpus with detailed annotations for speech, thought and writing representation (ST&WR). With approximately 490,000 tokens, it is the largest resource of its kind. It can be used to answer literary and linguistic research questions and serve as training material for machine learning. This paper describes the composition of the corpus and the annotation structure, discusses some methodological decisions and gives basic statistics about the forms of ST&WR found in this corpus
International audienceThe present German political speeches corpus follows from a initial release wh...
The "AAC- Austrian Academy Corpus " is a diachronic German language digital text corpus of...
The research project “German Today ” aims to determine the amount of regional variation in (near-)st...
This beta release contains the first subset of the "Redewiedergabe" corpus. It includes 619 text sam...
This article discusses questions concerning the creation, annotation and sharing of spoken language ...
This article presents the main results of a project, which explored ways to recognize and classify a...
This paper describes an approach to teaching German designed for students who wish to acquire a read...
This article presents the main results of a project, which explored ways to rec-ognize and classify ...
This paper discusses the technological and methodological challenges in creating and sharing HAMATAC...
This corpus consits of 200 annotated German legal judgements. In all judgements the parts "conclusio...
A digital corpus on variation in German (1800-1950) The German Innsbruck Corpus (GermInnC) 1800-195...
Corpora, i.e. collections of linguistic data (texts or conversations), are a fundamental asset of di...
The ultimate aim of the project is to compile a representative historical corpus of written German f...
This paper gives an insight into the basic concepts for a corpus-based lexical resource of spoken Ge...
The ultimate aim of the project is to compile a representative historical corpus of written German f...
International audienceThe present German political speeches corpus follows from a initial release wh...
The "AAC- Austrian Academy Corpus " is a diachronic German language digital text corpus of...
The research project “German Today ” aims to determine the amount of regional variation in (near-)st...
This beta release contains the first subset of the "Redewiedergabe" corpus. It includes 619 text sam...
This article discusses questions concerning the creation, annotation and sharing of spoken language ...
This article presents the main results of a project, which explored ways to recognize and classify a...
This paper describes an approach to teaching German designed for students who wish to acquire a read...
This article presents the main results of a project, which explored ways to rec-ognize and classify ...
This paper discusses the technological and methodological challenges in creating and sharing HAMATAC...
This corpus consits of 200 annotated German legal judgements. In all judgements the parts "conclusio...
A digital corpus on variation in German (1800-1950) The German Innsbruck Corpus (GermInnC) 1800-195...
Corpora, i.e. collections of linguistic data (texts or conversations), are a fundamental asset of di...
The ultimate aim of the project is to compile a representative historical corpus of written German f...
This paper gives an insight into the basic concepts for a corpus-based lexical resource of spoken Ge...
The ultimate aim of the project is to compile a representative historical corpus of written German f...
International audienceThe present German political speeches corpus follows from a initial release wh...
The "AAC- Austrian Academy Corpus " is a diachronic German language digital text corpus of...
The research project “German Today ” aims to determine the amount of regional variation in (near-)st...