Frequency lists of character-level n-grams were extracted from the Gigafida 2.0 Corpus of Written Standard Slovene (https://viri.cjvt.si/gigafida/) using the LIST corpus extraction tool (http://hdl.handle.net/11356/1227). The lists contain 1-5-gram combinations of characters occurring in the corpus along with their absolute and relative frequencies, percentages, and distribution across the text-types included in the corpus taxonomy. Character-level n-grams were extracted from lemmas (5 files) and lower-case word forms (5 files)
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle...
A collection of n-grams extracted from the IMP corpus of historical Slovene (cf. http://nl.ijs.si/im...
A collection of n-grams extracted from the Gos corpus of spoken Slovene (cf. http://eng.slovenscina....
Frequency lists of character-level n-grams were extracted from the GOS 1.0 Corpus of Spoken Slovene ...
Frequency lists of word-level n-grams (or word sets) were extracted from the Gigafida 2.0 Corpus of ...
Frequency lists of character-level n-grams were extracted from the GOS 1.0 Corpus of Spoken Slovene ...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of word-level n-grams (or word sets) were extracted from the GOS 1.0 Corpus of Spoke...
Frequency lists of words were extracted from the Gigafida 2.0 Corpus of Written Standard Slovene (ht...
Frequency lists of word-level n-grams (or word sets) were extracted from the GOS 1.0 Corpus of Spoke...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle...
A collection of n-grams extracted from the Kres corpus of written Slovene (cf. http://eng.slovenscin...
Frequency lists of words split into word parts were extracted from the GOS 1.0 Corpus of Spoken Slov...
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle...
A collection of n-grams extracted from the IMP corpus of historical Slovene (cf. http://nl.ijs.si/im...
A collection of n-grams extracted from the Gos corpus of spoken Slovene (cf. http://eng.slovenscina....
Frequency lists of character-level n-grams were extracted from the GOS 1.0 Corpus of Spoken Slovene ...
Frequency lists of word-level n-grams (or word sets) were extracted from the Gigafida 2.0 Corpus of ...
Frequency lists of character-level n-grams were extracted from the GOS 1.0 Corpus of Spoken Slovene ...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of word-level n-grams (or word sets) were extracted from the GOS 1.0 Corpus of Spoke...
Frequency lists of words were extracted from the Gigafida 2.0 Corpus of Written Standard Slovene (ht...
Frequency lists of word-level n-grams (or word sets) were extracted from the GOS 1.0 Corpus of Spoke...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle...
A collection of n-grams extracted from the Kres corpus of written Slovene (cf. http://eng.slovenscin...
Frequency lists of words split into word parts were extracted from the GOS 1.0 Corpus of Spoken Slov...
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle...
A collection of n-grams extracted from the IMP corpus of historical Slovene (cf. http://nl.ijs.si/im...
A collection of n-grams extracted from the Gos corpus of spoken Slovene (cf. http://eng.slovenscina....