Corpus of informal spoken Czech sized 1 MW. It contains transcriptions of 221 recordings made in 2002–2006 in the whole of Bohemia. All the recordings were made in informal situations to ensure prototypically spontaneous spoken language. This means private environment, physical presence of speakers who know each other, unscripted speech and topic not given in advance. The total number of speakers is 754, the metadata include sociolinguistic information about them. The corpus is provided in a (semi-XML) vertical format used as an input to the Manatee query engine. The data thus exactly correspond to the corpus available via query interface to registered users of the CNC
This paper presents the final version of the Czech Broadcast Conversation Corpus that will shortly b...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
Corpus of informal spoken Czech sized 1 MW. It contains transcriptions of 221 recordings made in 200...
Balanced corpus of informal spoken Czech sized 1 MW. It contains transcriptions of 297 recordings ma...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
This article introduces a new speech corpus, the Nijmegen Corpus of Casual Czech (NCCCz), which cont...
This article introduces a new speech corpus, the Nijmegen Corpus of Casual Czech (NCCCz), which cont...
The paper presents a corpus of spontaneous spoken Czech called ORAL2013, its design principles and p...
ORTOFON v1 is designed as a representation of authentic spoken Czech used in informal situations (pr...
The corpus contains speech data of 2 Czech native speakers, male and female. The speech is very prec...
The Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0) is a corpus of spoken language, consi...
This paper presents the final version of the Czech Broadcast Conversation Corpus that will shortly b...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
Corpus of informal spoken Czech sized 1 MW. It contains transcriptions of 221 recordings made in 200...
Balanced corpus of informal spoken Czech sized 1 MW. It contains transcriptions of 297 recordings ma...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
ORAL2013 is designed as a representation of authentic spoken Czech used in informal situations (priv...
This article introduces a new speech corpus, the Nijmegen Corpus of Casual Czech (NCCCz), which cont...
This article introduces a new speech corpus, the Nijmegen Corpus of Casual Czech (NCCCz), which cont...
The paper presents a corpus of spontaneous spoken Czech called ORAL2013, its design principles and p...
ORTOFON v1 is designed as a representation of authentic spoken Czech used in informal situations (pr...
The corpus contains speech data of 2 Czech native speakers, male and female. The speech is very prec...
The Prague Dependency Treebank of Spoken Czech 2.0 (PDTSC 2.0) is a corpus of spoken language, consi...
This paper presents the final version of the Czech Broadcast Conversation Corpus that will shortly b...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...
Corpus of contemporary written (printed) Czech sized 3.6 GW (i.e. 4.3 billion tokens). It covers mos...