The Prague family of annotated corpora has a new member, the Czech Academic Corpus version 2.0 (CAC 2.0). CAC 2.0 consists of 650,000 words from various 1970s and 1980s newspapers, magazines and radio and television broadcast transcripts manually annotated for morphology and syntax
This paper presents the final version of the Czech Broadcast Conversation Corpus that will shortly b...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
Czech Named Entity Corpus 2.0 is a corpus of 8993 Czech sentences with manually annotated 35220 Czec...
This article introduces a very large Czech text corpus for language research – csTenTen17 compiled f...
Corpus of contemporary written (printed) Czech sized 4.7 GW (i.e. 5.7 billion tokens). It covers mos...
Web corpus of Czech, created in 2011. Contains newspapers+magazines, discussions, blogs. See http://...
The Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0) is a corpus of spoken language, consi...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
BASIC INFORMATION -------------------- Czech Text Document Corpus v 2.0 is a collection of text do...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
BASIC INFORMATION -------------------- Czech Text Document Corpus v 2.0 is a collection of text do...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
In our paper, we present main results of the Czech grant project Internet as a Language Corpus, whos...
This paper presents the final version of the Czech Broadcast Conversation Corpus that will shortly b...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
Czech Named Entity Corpus 2.0 is a corpus of 8993 Czech sentences with manually annotated 35220 Czec...
This article introduces a very large Czech text corpus for language research – csTenTen17 compiled f...
Corpus of contemporary written (printed) Czech sized 4.7 GW (i.e. 5.7 billion tokens). It covers mos...
Web corpus of Czech, created in 2011. Contains newspapers+magazines, discussions, blogs. See http://...
The Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0) is a corpus of spoken language, consi...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
BASIC INFORMATION -------------------- Czech Text Document Corpus v 2.0 is a collection of text do...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
BASIC INFORMATION -------------------- Czech Text Document Corpus v 2.0 is a collection of text do...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
In our paper, we present main results of the Czech grant project Internet as a Language Corpus, whos...
This paper presents the final version of the Czech Broadcast Conversation Corpus that will shortly b...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...