The paper describes two interrelated language resources: a database of 13,000 Estonian multi-word verbs (MWV) and a 300,000 word corpus with annotated MWVs. Both resources have been manually post-edited, and are meant to be used by a wide audience, from corpus linguists to language engineers. The paper gives a short overview of the types of MWVs in Estonian, followed by a description of some grammatical features – word order and inflection – of Estonian and their manifestation in the MWVs. The database is a table that has 13,000 rows and 11 columns and contains information about the source (dictionary or corpus) of the MWV, its linguistic category, frequency in the text corpus, and morphological description. The text corpus contains the mor...
Corpus of texts written fully or partly in Estonian, from 13.-19. century; 1,5 million word
The Estonian Collocations Dictionary will be a monolingual online, corpus-driven, scholarly dictiona...
Multi-word expressions are a frequent phenomenon having some specific properties with the result tha...
This paper describes automatic treatment of multi-word expressions in a morphologically complex flec...
The paper describes extraction of Estonian multi-word verbs from text corpora, using a language- and...
The article expounds on some trends witnessed in the use of object cases in Estonian. It is a synchr...
The paper describes a morphological analyser for Estonian and how using a text corpus influenced the...
Recordings of different Estonian dialects, 900000 words, transcribed and partly (400000 words) morph...
This paper discusses the need for a modern Estonian reference grammar for learners and lays the basi...
The paper describes an experiment of finding Estonian multi-word verbs in a text corpus. After descr...
The creation of syntactically annotated corpora of Estonian started at the end of 1990s with the tra...
Traditional Estonian dialect classifications are based on the phonology, morphology, and lexis, and ...
This dataset makes available the sample of clauses used in the study "A corpus study of grammatical ...
The experimental two-level morphology of Estonian is under development at the University of Tartu. T...
This paper introduces our work for adapting a rule based parser of spoken Estonian to the morphologi...
Corpus of texts written fully or partly in Estonian, from 13.-19. century; 1,5 million word
The Estonian Collocations Dictionary will be a monolingual online, corpus-driven, scholarly dictiona...
Multi-word expressions are a frequent phenomenon having some specific properties with the result tha...
This paper describes automatic treatment of multi-word expressions in a morphologically complex flec...
The paper describes extraction of Estonian multi-word verbs from text corpora, using a language- and...
The article expounds on some trends witnessed in the use of object cases in Estonian. It is a synchr...
The paper describes a morphological analyser for Estonian and how using a text corpus influenced the...
Recordings of different Estonian dialects, 900000 words, transcribed and partly (400000 words) morph...
This paper discusses the need for a modern Estonian reference grammar for learners and lays the basi...
The paper describes an experiment of finding Estonian multi-word verbs in a text corpus. After descr...
The creation of syntactically annotated corpora of Estonian started at the end of 1990s with the tra...
Traditional Estonian dialect classifications are based on the phonology, morphology, and lexis, and ...
This dataset makes available the sample of clauses used in the study "A corpus study of grammatical ...
The experimental two-level morphology of Estonian is under development at the University of Tartu. T...
This paper introduces our work for adapting a rule based parser of spoken Estonian to the morphologi...
Corpus of texts written fully or partly in Estonian, from 13.-19. century; 1,5 million word
The Estonian Collocations Dictionary will be a monolingual online, corpus-driven, scholarly dictiona...
Multi-word expressions are a frequent phenomenon having some specific properties with the result tha...