The paper presents the quite long-standing tradition of Romanian corpus acquisition and processing, which reaches its peak with the reference corpus of contemporary Romanian language (CoRoLa). The paper describes decisions behind the kinds of texts collected, as well as processing and annotation steps, highlighting the structure and importance of metadata to the corpus. The reader is also introduced to the three ways in which (s)he can plunge into the rich linguistic data of the corpus, waiting to be discovered. Besides querying the corpus, word embeddings extracted from it are useful to various natural language processing applications and for linguists, when user-friendly interfaces offer them the possibility to exploit the data
The article briefly reviews bilingual Slovak-Bulgarian/Bulgarian-Slovak parallel and aligned corpus....
The paper relates about our ongoing work on the creation of a corpus of Bulgarian and Ukrainian para...
Contemporary information technologies and mathematical modelling has made creating corpora of natura...
This article reports on the on-going CoRoLa project, aiming at creating a reference corpus of contem...
This paper presents the almost final results of a priority project of the Romanian Academy – the Cor...
The present paper examines a variety of ways in which the Corpus of Contemporary Romanian Language (...
DRuKoLA, the accompanying project in the making of the Corpus of Romanian Language, is a cooperation...
The corpus contains academic papers from eight disciplines, written by the Romanian students in nati...
This paper presents the project “The first Romanian bilingual dictionaries (17th century). Digitally...
In its first part, the lecture outlines the challenges of corpus building for the Romanian ELTeC col...
The extraordinary growth of computer applications, particularly over the last two decades, has enabl...
The paper discusses several key concepts related to the development of corpora and reconsiders them ...
International audienceIn this paper we describe our work on a treebank for Serbian, which aims to pr...
Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016)...
The wordnet-style lexicography has imposed itself as one of the most representative and useful metho...
The article briefly reviews bilingual Slovak-Bulgarian/Bulgarian-Slovak parallel and aligned corpus....
The paper relates about our ongoing work on the creation of a corpus of Bulgarian and Ukrainian para...
Contemporary information technologies and mathematical modelling has made creating corpora of natura...
This article reports on the on-going CoRoLa project, aiming at creating a reference corpus of contem...
This paper presents the almost final results of a priority project of the Romanian Academy – the Cor...
The present paper examines a variety of ways in which the Corpus of Contemporary Romanian Language (...
DRuKoLA, the accompanying project in the making of the Corpus of Romanian Language, is a cooperation...
The corpus contains academic papers from eight disciplines, written by the Romanian students in nati...
This paper presents the project “The first Romanian bilingual dictionaries (17th century). Digitally...
In its first part, the lecture outlines the challenges of corpus building for the Romanian ELTeC col...
The extraordinary growth of computer applications, particularly over the last two decades, has enabl...
The paper discusses several key concepts related to the development of corpora and reconsiders them ...
International audienceIn this paper we describe our work on a treebank for Serbian, which aims to pr...
Proceedings of the First PhD Symposium on Sustainable Ultrascale Computing Systems (NESUS PhD 2016)...
The wordnet-style lexicography has imposed itself as one of the most representative and useful metho...
The article briefly reviews bilingual Slovak-Bulgarian/Bulgarian-Slovak parallel and aligned corpus....
The paper relates about our ongoing work on the creation of a corpus of Bulgarian and Ukrainian para...
Contemporary information technologies and mathematical modelling has made creating corpora of natura...