A Multi-Word Expression (MWE) is a sequence of >=2 words, which functions as a single unit at linguistic analysis, e.g. syntactical, morphological, etc. Identification of MWEs is one of the most challenging problems in NLP. Many techniques are used for this problem, however, not all of them can be transferred to Lithuanian and Latvian due to rich morphology. In this stage, we use raw corpus (LT and LV, 9 mln. words for each language) and a combination of lexical association measures (LAMS) and supervised machine learning (ML), and look for bi-gram MWEs. EuroVoc, a Multilingual Thesaurus of the European Union is used to evaluate MWE candidates. The candidate MWE bi-grams were extracted from raw text and 5 LAMs (Maximum Likelihood Estimation,...
Promocijas darba pētījuma priekšmets ir automātiskas teksta analīzes metodes, apskatot visus dabiskā...
Abstract. This paper describes our research on statistical language modeling of Lithuanian. The idea...
Treatment of Multiword Expressions (MWEs) is one of the most complicated issues in natural language ...
This research was funded by a grant (No. LIP- 027/2016) from the Research Council of LithuaniaWe dis...
ISSN: 1819-9224 (online version). Manuscript received October 10, 2017. This research was partly fun...
Knygos ISBN 978-1-61499-701-6 (online)Identification of MultiWord Expressions (MWE) is one of the mo...
ISSN: 2078-0958 (Print); ISSN: 2078-0966 (Online)We discuss an experiment on automatic identificatio...
Identification of Multiword Expressions is an important problem in Natural Language Processing, espe...
eISSN 2071-2987. This research was funded by the Research Council of Lithuania (No. LIP-027/2016)). ...
We describe an approach for morphological analysis combining a rule-based word level morphological a...
eISSN 1650-3740This article presents a study of lemmatisation of flexible multiword expressions in L...
As the development of information technologies makes progress, large morphologically annotated corpo...
This paper describes our research on statistical language modeling of Lithuanian. The idea of improv...
Šiame straipsnyje analizuojami dvižodžiai pastovieji junginiai, kurie pateikti dviem ir daugiau form...
This paper reports on a specific problem of automatic terminology extraction in Lithuanian – base fo...
Promocijas darba pētījuma priekšmets ir automātiskas teksta analīzes metodes, apskatot visus dabiskā...
Abstract. This paper describes our research on statistical language modeling of Lithuanian. The idea...
Treatment of Multiword Expressions (MWEs) is one of the most complicated issues in natural language ...
This research was funded by a grant (No. LIP- 027/2016) from the Research Council of LithuaniaWe dis...
ISSN: 1819-9224 (online version). Manuscript received October 10, 2017. This research was partly fun...
Knygos ISBN 978-1-61499-701-6 (online)Identification of MultiWord Expressions (MWE) is one of the mo...
ISSN: 2078-0958 (Print); ISSN: 2078-0966 (Online)We discuss an experiment on automatic identificatio...
Identification of Multiword Expressions is an important problem in Natural Language Processing, espe...
eISSN 2071-2987. This research was funded by the Research Council of Lithuania (No. LIP-027/2016)). ...
We describe an approach for morphological analysis combining a rule-based word level morphological a...
eISSN 1650-3740This article presents a study of lemmatisation of flexible multiword expressions in L...
As the development of information technologies makes progress, large morphologically annotated corpo...
This paper describes our research on statistical language modeling of Lithuanian. The idea of improv...
Šiame straipsnyje analizuojami dvižodžiai pastovieji junginiai, kurie pateikti dviem ir daugiau form...
This paper reports on a specific problem of automatic terminology extraction in Lithuanian – base fo...
Promocijas darba pētījuma priekšmets ir automātiskas teksta analīzes metodes, apskatot visus dabiskā...
Abstract. This paper describes our research on statistical language modeling of Lithuanian. The idea...
Treatment of Multiword Expressions (MWEs) is one of the most complicated issues in natural language ...