BASIC INFORMATION -------------------- Czech Text Document Corpus v 2.0 is a collection of text documents for automatic document classification in Czech language. It is composed of the text documents provided by the Czech News Agency and is freely available for research purposes. This corpus was created in order to facilitate a straightforward comparison of the document classification approaches on Czech data. It is particularly dedicated to evaluation of multi-label document classification approaches, because one document is usually labelled with more than one label. Besides the information about the document classes, the corpus is also annotated at the morphological layer. The main part (for training and testing) is composed of 11,95...
The Prague Dependency Treebank 2.0 (PDT 2.0) contains a large amount of Czech texts with complex and...
The Prague Dependency Treebank 2.0 (PDT 2.0) contains a large amount of Czech texts with complex and...
This package comprises eight models of Czech word embeddings trained by applying word2vec (Mikolov e...
BASIC INFORMATION -------------------- Czech Text Document Corpus v 2.0 is a collection of text do...
Czech Named Entity Corpus 2.0 is a corpus of 8993 Czech sentences with manually annotated 35220 Czec...
The Prague family of annotated corpora has a new member, the Czech Academic Corpus version 2.0 (CAC ...
In our paper, we present main results of the Czech grant project Internet as a Language Corpus, whos...
Representative corpus of contemporary written Czech sized 100 MW. It was created as a representation...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
The Czech Web Corpus 2017 (csTenTen17) is a Czech corpus made up of texts collected from the Interne...
In this paper the lexis of everyday spoken Czech is compared with literary Czech lexis. The data is ...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
Corpus of contemporary written (printed) Czech sized 4.7 GW (i.e. 5.7 billion tokens). It covers mos...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
The Prague Dependency Treebank 2.0 (PDT 2.0) contains a large amount of Czech texts with complex and...
The Prague Dependency Treebank 2.0 (PDT 2.0) contains a large amount of Czech texts with complex and...
This package comprises eight models of Czech word embeddings trained by applying word2vec (Mikolov e...
BASIC INFORMATION -------------------- Czech Text Document Corpus v 2.0 is a collection of text do...
Czech Named Entity Corpus 2.0 is a corpus of 8993 Czech sentences with manually annotated 35220 Czec...
The Prague family of annotated corpora has a new member, the Czech Academic Corpus version 2.0 (CAC ...
In our paper, we present main results of the Czech grant project Internet as a Language Corpus, whos...
Representative corpus of contemporary written Czech sized 100 MW. It was created as a representation...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
The Czech Web Corpus 2017 (csTenTen17) is a Czech corpus made up of texts collected from the Interne...
In this paper the lexis of everyday spoken Czech is compared with literary Czech lexis. The data is ...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
Corpus of contemporary written (printed) Czech sized 4.7 GW (i.e. 5.7 billion tokens). It covers mos...
The presented Czech Named Entity Corpus 1.0 is the first publicly available corpus providing a large...
The Prague Dependency Treebank 2.0 (PDT 2.0) contains a large amount of Czech texts with complex and...
The Prague Dependency Treebank 2.0 (PDT 2.0) contains a large amount of Czech texts with complex and...
This package comprises eight models of Czech word embeddings trained by applying word2vec (Mikolov e...