A comprehensive corpus of news articles on the topic of language, published in major Slovenian daily newspapers and news portals in the five-year period of January 1, 2015 - January 1, 2020. The corpus is designed to facilitate research on metalanguage (‘language about language’), linguistic ideologies, language policy and planning, as well as the specific contemporary debates on language defining, naming, and standardisation, ongoing in post-Yugoslav societies. The corpus has been tagged using the CLASSLA-StanfordNLP models for morphosyntactic annotation and lemmatisation of standard Slovenian. The corpus is available in plain text version, XML with full metadata, and tagged CONLL-U format. MetaLangNEWS-Sl is complemented with a separate...
The SlovParl corpus contains minutes of the Assembly of the Republic of Slovenia for the legislative...
The Trendi corpus is a monitor corpus of Slovene. It contains news from 106 different media websites...
In this paper we present Trendi, a monitor corpus of written Slovene, which has been compiled recent...
A comprehensive corpus of news articles on the topic of language, published in major Serbian daily n...
A comprehensive corpus of user comments on online news articles on the topic of language from major ...
A comprehensive corpus of news articles on the topic of language, published in major Macedonian dail...
A comprehensive corpus of user comments on online news articles on the topic of language from major ...
A comprehensive corpus of news articles on the topic of language, published in major Montenegrin dai...
A comprehensive corpus of news articles on the topic of language, published in major daily newspaper...
A comprehensive corpus of user comments on online news articles on the topic of language from major ...
A comprehensive corpus of user comments on online news articles on the topic of language from major ...
Growing interest in meta-language, in linguistics and other disciplines, has highlighted a gap in me...
The Trendi corpus is a monitor corpus of Slovene. It contains news from 107 different media websites...
In the last decade, corpus linguistics has finally established itself as a separate research startin...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
The SlovParl corpus contains minutes of the Assembly of the Republic of Slovenia for the legislative...
The Trendi corpus is a monitor corpus of Slovene. It contains news from 106 different media websites...
In this paper we present Trendi, a monitor corpus of written Slovene, which has been compiled recent...
A comprehensive corpus of news articles on the topic of language, published in major Serbian daily n...
A comprehensive corpus of user comments on online news articles on the topic of language from major ...
A comprehensive corpus of news articles on the topic of language, published in major Macedonian dail...
A comprehensive corpus of user comments on online news articles on the topic of language from major ...
A comprehensive corpus of news articles on the topic of language, published in major Montenegrin dai...
A comprehensive corpus of news articles on the topic of language, published in major daily newspaper...
A comprehensive corpus of user comments on online news articles on the topic of language from major ...
A comprehensive corpus of user comments on online news articles on the topic of language from major ...
Growing interest in meta-language, in linguistics and other disciplines, has highlighted a gap in me...
The Trendi corpus is a monitor corpus of Slovene. It contains news from 107 different media websites...
In the last decade, corpus linguistics has finally established itself as a separate research startin...
The siParl corpus contains minutes of the Assembly of the Republic of Slovenia for 11th legislative ...
The SlovParl corpus contains minutes of the Assembly of the Republic of Slovenia for the legislative...
The Trendi corpus is a monitor corpus of Slovene. It contains news from 106 different media websites...
In this paper we present Trendi, a monitor corpus of written Slovene, which has been compiled recent...