This release contains Patrick Burns's ( @diyclassics ) statistical lemmatizer (Greek and Latin). Docs: http://docs.cltk.org/en/latest/latin.html#lemmatization-backoff-method About: cltk/lemmatize/readme.md:
elexiko is an online information system ("dictionary") on contemporary German language (mainly post ...
Abstract: Lemmatisation is the process of finding the normalised forms of words appearing in text. I...
LeQua 2022 is a new lab for the evaluation of methods for “learning to quantify” in textual datasets...
Abstract This article presents the result of accuracy tests for currently available Ancient Gree...
1) Fully automatic rule based lemmatization of inflected languages 2) Fully automatic training of le...
1) Fully automatic rule based lemmatization of inflected languages 2) Fully automatic training of le...
Contains fulltext : 176616.pdf (publisher's version ) (Open Access)DATeCH: Digital...
LAGT is s a dataset of lemmatized ancient Greek texts, combining works from Perseus Digital Library ...
The model for lemmatisation of standard Macedonian was built with the CLASSLA-StanfordNLP tool (http...
Including @diyclassics 's LineTokenizer ( #530 ) and lots of small additions done in the lead-up to ...
Data set contains glossary with lemmas and domain. Lemmas are in Macedonian, while domains are in En...
English summary: Towards an on-line Software of Concordancing-lemmatising for Ancient Greek. One o...
The model for lemmatisation of standard Croatian was built with the CLASSLA-StanfordNLP tool (https:...
breaking change: language data pre-loading now occurs internally, language codes are now directly pr...
The task of corpus-dictionary linkage (CDL) is to annotate each word in a corpus with a link to an a...
elexiko is an online information system ("dictionary") on contemporary German language (mainly post ...
Abstract: Lemmatisation is the process of finding the normalised forms of words appearing in text. I...
LeQua 2022 is a new lab for the evaluation of methods for “learning to quantify” in textual datasets...
Abstract This article presents the result of accuracy tests for currently available Ancient Gree...
1) Fully automatic rule based lemmatization of inflected languages 2) Fully automatic training of le...
1) Fully automatic rule based lemmatization of inflected languages 2) Fully automatic training of le...
Contains fulltext : 176616.pdf (publisher's version ) (Open Access)DATeCH: Digital...
LAGT is s a dataset of lemmatized ancient Greek texts, combining works from Perseus Digital Library ...
The model for lemmatisation of standard Macedonian was built with the CLASSLA-StanfordNLP tool (http...
Including @diyclassics 's LineTokenizer ( #530 ) and lots of small additions done in the lead-up to ...
Data set contains glossary with lemmas and domain. Lemmas are in Macedonian, while domains are in En...
English summary: Towards an on-line Software of Concordancing-lemmatising for Ancient Greek. One o...
The model for lemmatisation of standard Croatian was built with the CLASSLA-StanfordNLP tool (https:...
breaking change: language data pre-loading now occurs internally, language codes are now directly pr...
The task of corpus-dictionary linkage (CDL) is to annotate each word in a corpus with a link to an a...
elexiko is an online information system ("dictionary") on contemporary German language (mainly post ...
Abstract: Lemmatisation is the process of finding the normalised forms of words appearing in text. I...
LeQua 2022 is a new lab for the evaluation of methods for “learning to quantify” in textual datasets...