The 'Deutsche Referenzkorpus (DeReKo)' of the Mannheimer Institut für Deutsche Sprache currently contains over 28 billion words, and it is constantly being expanded. The sheer size of the corpus makes it impractical for researchers to analyze its entire content. On the other hand, the DeReKo offers the possibility of taking seriously the principle that every research project needs its own corpus - by acting as a 'reference corpus' that can be used in combination with special corpora. This paper addresses the question of whether a corpus should contain complete texts or only statistically relevant extracts; it also discusses the uses and necessity of 'small corpora'
Large linguistic corpora are becoming increasingly important for linguistic work. For several years ...
This paper gives an overview of recent developments in the German Reference Corpus DeReKo in terms o...
When analysing corpora with automatic and statistical means, one should remember that the raw materi...
^This paper describes DeReKo (Deutsches Referenzkorpus), the Archive of General Reference Corpora of...
This paper describes DEREKO, the archive of general reference corpora of contemporary written German...
This paper discusses current trends in DeReKo, the German Reference Corpus, concerning legal issues ...
A corpus is a collection of texts in electronic form that are the object of literary or linguistic s...
As corpus building is an activity that takes times and costs money, readers may wish to use ready-ma...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
A corpus is a collection of texts in electronic form that are the object of literary or linguistic s...
A corpus is a collection of texts in electronic form that are the object of literary or linguistic s...
Nowadays a corpus is typically a large collection of text excerpts, representing a range of register...
Der Beitrag betrachtet das Deutsche Referenzkorpus DeReKo in Bezug auf Strategien für seinen Ausbau,...
Der Beitrag betrachtet das Deutsche Referenzkorpus DeReKo in Bezug auf Strategien für seinen Ausbau,...
Large linguistic corpora are becoming increasingly important for linguistic work. For several years ...
This paper gives an overview of recent developments in the German Reference Corpus DeReKo in terms o...
When analysing corpora with automatic and statistical means, one should remember that the raw materi...
^This paper describes DeReKo (Deutsches Referenzkorpus), the Archive of General Reference Corpora of...
This paper describes DEREKO, the archive of general reference corpora of contemporary written German...
This paper discusses current trends in DeReKo, the German Reference Corpus, concerning legal issues ...
A corpus is a collection of texts in electronic form that are the object of literary or linguistic s...
As corpus building is an activity that takes times and costs money, readers may wish to use ready-ma...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
A corpus is a collection of authentic, non-elicited texts selected and assembled to study language. ...
A corpus is a collection of texts in electronic form that are the object of literary or linguistic s...
A corpus is a collection of texts in electronic form that are the object of literary or linguistic s...
Nowadays a corpus is typically a large collection of text excerpts, representing a range of register...
Der Beitrag betrachtet das Deutsche Referenzkorpus DeReKo in Bezug auf Strategien für seinen Ausbau,...
Der Beitrag betrachtet das Deutsche Referenzkorpus DeReKo in Bezug auf Strategien für seinen Ausbau,...
Large linguistic corpora are becoming increasingly important for linguistic work. For several years ...
This paper gives an overview of recent developments in the German Reference Corpus DeReKo in terms o...
When analysing corpora with automatic and statistical means, one should remember that the raw materi...