LAGT is s a dataset of lemmatized ancient Greek texts, combining works from Perseus Digital Library and First 1000 Years of Greek. The scripts used to produce this version of the dataset are available from //github.com/sdam-au/LAGT. Concerning lemmatization, the dataset contains lemmatized sentences in a form of list-of-lists, with sublist elements representing individual lemmata. It contains only nouns, proper names, verbs and adjectives. Whenever available, the lemmata are based on the GLAUx corpus: https://github.com/perseids-publications/glaux-trees
The task of corpus-dictionary linkage (CDL) is to annotate each word in a corpus with a link to an a...
Building a lemmatised corpus of German Sign Language (DGS) using iLex; lemmatisation as top-down and...
The Research Project in Greek Lexicology (UCL: Prof. B. Coulie, Dr. B. Kindt) pursues a twin goal: o...
LAGT is s a dataset of lemmatized ancient Greek texts, combining works from Perseus Digital Library ...
Abstract This article presents the result of accuracy tests for currently available Ancient Gree...
English summary: Towards an on-line Software of Concordancing-lemmatising for Ancient Greek. One o...
Contains fulltext : 176616.pdf (publisher's version ) (Open Access)DATeCH: Digital...
To facilitate corpus searches by classicists as well as to reduce data sparsity when training models...
The GIST dataset (Greek Insriptions in Time & Space) represents a comprehensive collection of ancien...
This poster presentation describes the addition of an index of 1,763 Ancient Greek loanwords to the ...
This paper describes the addition of an index of 1; 763 Ancient Greek loanwords to the collection of...
Data set contains glossary with lemmas and domain. Lemmas are in Macedonian, while domains are in En...
The Diorisis Ancient Greek Corpus is a digital collection of ancient Greek texts (from Homer to the ...
Lemmatization and Morphological lagging : their Application to Authorship Attribution. - Traditional...
This paper presents the structure of the LiLa Knowledge Base, i.e. a collection of multifarious lin...
The task of corpus-dictionary linkage (CDL) is to annotate each word in a corpus with a link to an a...
Building a lemmatised corpus of German Sign Language (DGS) using iLex; lemmatisation as top-down and...
The Research Project in Greek Lexicology (UCL: Prof. B. Coulie, Dr. B. Kindt) pursues a twin goal: o...
LAGT is s a dataset of lemmatized ancient Greek texts, combining works from Perseus Digital Library ...
Abstract This article presents the result of accuracy tests for currently available Ancient Gree...
English summary: Towards an on-line Software of Concordancing-lemmatising for Ancient Greek. One o...
Contains fulltext : 176616.pdf (publisher's version ) (Open Access)DATeCH: Digital...
To facilitate corpus searches by classicists as well as to reduce data sparsity when training models...
The GIST dataset (Greek Insriptions in Time & Space) represents a comprehensive collection of ancien...
This poster presentation describes the addition of an index of 1,763 Ancient Greek loanwords to the ...
This paper describes the addition of an index of 1; 763 Ancient Greek loanwords to the collection of...
Data set contains glossary with lemmas and domain. Lemmas are in Macedonian, while domains are in En...
The Diorisis Ancient Greek Corpus is a digital collection of ancient Greek texts (from Homer to the ...
Lemmatization and Morphological lagging : their Application to Authorship Attribution. - Traditional...
This paper presents the structure of the LiLa Knowledge Base, i.e. a collection of multifarious lin...
The task of corpus-dictionary linkage (CDL) is to annotate each word in a corpus with a link to an a...
Building a lemmatised corpus of German Sign Language (DGS) using iLex; lemmatisation as top-down and...
The Research Project in Greek Lexicology (UCL: Prof. B. Coulie, Dr. B. Kindt) pursues a twin goal: o...