This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical features) from the Trendi Monitor Corpus of Slovene (http://hdl.handle.net/11356/1590) covering the period between 1 January 2020 and 31 December 2020 using the LIST corpus extraction tool (http://hdl.handle.net/11356/1227). The Trendi frequency list was then compared to the frequency list of words from the Gigafida 2.0 Corpus of Slovene (http://hdl.handle.net/11356/1320), which covers the period between 1991 and 2018, and the frequency list of words from Trendi for 2019. The words were compared using the simple maths formula implemented by SketchEngine (see https://www.sketchengine.eu/documentation/simple-maths/). The final list contains lemmas...
The Trendi corpus is a monitor corpus of Slovene. It contains news from 106 different media websites...
Frequency lists of character-level n-grams were extracted from the Gigafida 2.0 Corpus of Written St...
Frequency lists of word-level n-grams (or word sets) were extracted from the GOS 1.0 Corpus of Spoke...
This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical featur...
This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical featur...
The frequency list of words by source was prepared in the following manner: words (i.e. lemmas with ...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of words were extracted from the Gigafida 2.0 Corpus of Written Standard Slovene (ht...
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle...
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle...
Frequency lists of words split into word parts were extracted from the GOS 1.0 Corpus of Spoken Slov...
Frequency lists of word-level n-grams (or word sets) were extracted from the Gigafida 2.0 Corpus of ...
The Trendi corpus is a monitor corpus of Slovene. It contains news from 107 different media websites...
The Trendi corpus is a monitor corpus of Slovene. It contains news from 106 different media websites...
Frequency lists of character-level n-grams were extracted from the Gigafida 2.0 Corpus of Written St...
Frequency lists of word-level n-grams (or word sets) were extracted from the GOS 1.0 Corpus of Spoke...
This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical featur...
This frequency list of words was prepared by extracting words (i.e. lemmas with their lexical featur...
The frequency list of words by source was prepared in the following manner: words (i.e. lemmas with ...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of word-level n-grams (or word sets) were extracted from the Trendi Monitor Corpus o...
Frequency lists of words were extracted from the Gigafida 2.0 Corpus of Written Standard Slovene (ht...
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle...
Frequency lists of words were extracted from the GOS 1.0 Corpus of Spoken Slovene (http://hdl.handle...
Frequency lists of words split into word parts were extracted from the GOS 1.0 Corpus of Spoken Slov...
Frequency lists of word-level n-grams (or word sets) were extracted from the Gigafida 2.0 Corpus of ...
The Trendi corpus is a monitor corpus of Slovene. It contains news from 107 different media websites...
The Trendi corpus is a monitor corpus of Slovene. It contains news from 106 different media websites...
Frequency lists of character-level n-grams were extracted from the Gigafida 2.0 Corpus of Written St...
Frequency lists of word-level n-grams (or word sets) were extracted from the GOS 1.0 Corpus of Spoke...