A lemmatiser-tagger must not only lemmatise word-forms consisting of a single lexical element but it must also be able to detect complex units. In this paper we try to delimit linguistically which complex words deserve to be lemmatised as a unit. Then, we propose a formal description for MultiWord Lexical Units (MWLU) in Basque ---resulting of a conscientious analysis of their syntactic and morphological behaviour. Based on that formal description we propose a simple logical formalism to represent those MWLUs so that they can be automatically processed. 1. Introduction A lemmatiser-tagger is a computational tool used for assigning the correct lemma and grammatical category to each token of a corpus. It is a basic device for corpus analysi...
This paper analyses morphological multiword units of inflective or uninflective parts of speech. It ...
International audienceThe morphosyntactic treatment of multi- word units is particularly challenging...
The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Lang...
This paper describes the representation of Basque Multiword Lexical Units and the automatic processi...
Multi-word Lexical Units (MWLU) are of great importance in language in general, and in Natural Langu...
This paper describes the components of a robust and wide-coverage morphological analyser for Basque....
Abstract. The selection of appropriate Lexical Units (LUs) is an important issue in the development ...
EDBL (Euskararen Datu-Base Lexikala) is a general-purpose lexical database used in Basque text-proce...
Tiek lietuvių, tiek ir kitos kalbos yra fraziškos. Tai reiškia, kad kalbėdami renkamės ne atskirus ž...
Tiek lietuvių, tiek ir kitos kalbos yra fraziškos. Tai reiškia, kad kalbėdami renkamės ne atskirus ž...
eISSN 1650-3740This article presents a study of lemmatisation of flexible multiword expressions in L...
This paper describes the work carried out to improve the robustness of the morphological analyser/ge...
This paper presents the methodology followed in the construction of a surfacebased morphosyntactic p...
Collections of Basque proverbs and idioms have been compiled since the 16th century, but it was no...
The selection of appropriate Lexical Units (LUs) is an important issue in the development of Continu...
This paper analyses morphological multiword units of inflective or uninflective parts of speech. It ...
International audienceThe morphosyntactic treatment of multi- word units is particularly challenging...
The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Lang...
This paper describes the representation of Basque Multiword Lexical Units and the automatic processi...
Multi-word Lexical Units (MWLU) are of great importance in language in general, and in Natural Langu...
This paper describes the components of a robust and wide-coverage morphological analyser for Basque....
Abstract. The selection of appropriate Lexical Units (LUs) is an important issue in the development ...
EDBL (Euskararen Datu-Base Lexikala) is a general-purpose lexical database used in Basque text-proce...
Tiek lietuvių, tiek ir kitos kalbos yra fraziškos. Tai reiškia, kad kalbėdami renkamės ne atskirus ž...
Tiek lietuvių, tiek ir kitos kalbos yra fraziškos. Tai reiškia, kad kalbėdami renkamės ne atskirus ž...
eISSN 1650-3740This article presents a study of lemmatisation of flexible multiword expressions in L...
This paper describes the work carried out to improve the robustness of the morphological analyser/ge...
This paper presents the methodology followed in the construction of a surfacebased morphosyntactic p...
Collections of Basque proverbs and idioms have been compiled since the 16th century, but it was no...
The selection of appropriate Lexical Units (LUs) is an important issue in the development of Continu...
This paper analyses morphological multiword units of inflective or uninflective parts of speech. It ...
International audienceThe morphosyntactic treatment of multi- word units is particularly challenging...
The correct interpretation of Multiword Units (MWUs) is crucial to many applications in Natural Lang...