A core issue that hampers development and use of language technology for underresourced and morphologically rich languages is data sparsity. In this work, we consider unsupervised morphological analysis and lemmatization — two linguistically motivated ways to combat problems with sparse data. The morphological analysis aims to represent words in terms of the smallest meaningful units of language — morphemes (e.g., acid +ify +ed), while lemmatization concerns individual relationships among words (e.g., walks, walking and walked all are different forms of the lexeme walk). In this thesis, we focus on morphology learning in low-resource scenarios: we propose algorithms and methods that learn unsupervised morphological analysis and lemmatizatio...
We propose to cast the task of morphological inflection—mapping a lemma to an indicated inflected fo...
Computational morphology is a core component in many different types of natural language processing,...
We present a novel way of generating un-seen words, which is useful for certain ap-plications such a...
This article surveys resource-light monolingual approaches to morphological analysis and tagging. Wh...
This article surveys work on Unsupervised Learning of Morphology. We define Unsupervised Learning of...
The morphology of a language is a knowledge of the ways in which the language’s words can change in ...
Many Uralic languages have a rich morphological structure, but lack tools of morphological analysis ...
Morfette is a modular, data-driven, probabilistic system which learns to perform joint morphological...
Morphological analysis is an important subtask in text-to-speech conversion, hyphenation, and other ...
Many Uralic languages have a rich morphological structure, but lack morphological analysis tools nee...
We show how to express the problem of finding an optimal morpheme segmentation from a set of labelle...
The development of rich, multi-lingual corpora is essential for enabling new types of large-scale in...
Morphological analysis is used to study the internal structure words by reducing the number of vocab...
We present a novel method of statisti-cal morphological generation, i.e. the pre-diction of inflecte...
Morphological analysis provides a decomposition of words into smaller constituents. It is an importa...
We propose to cast the task of morphological inflection—mapping a lemma to an indicated inflected fo...
Computational morphology is a core component in many different types of natural language processing,...
We present a novel way of generating un-seen words, which is useful for certain ap-plications such a...
This article surveys resource-light monolingual approaches to morphological analysis and tagging. Wh...
This article surveys work on Unsupervised Learning of Morphology. We define Unsupervised Learning of...
The morphology of a language is a knowledge of the ways in which the language’s words can change in ...
Many Uralic languages have a rich morphological structure, but lack tools of morphological analysis ...
Morfette is a modular, data-driven, probabilistic system which learns to perform joint morphological...
Morphological analysis is an important subtask in text-to-speech conversion, hyphenation, and other ...
Many Uralic languages have a rich morphological structure, but lack morphological analysis tools nee...
We show how to express the problem of finding an optimal morpheme segmentation from a set of labelle...
The development of rich, multi-lingual corpora is essential for enabling new types of large-scale in...
Morphological analysis is used to study the internal structure words by reducing the number of vocab...
We present a novel method of statisti-cal morphological generation, i.e. the pre-diction of inflecte...
Morphological analysis provides a decomposition of words into smaller constituents. It is an importa...
We propose to cast the task of morphological inflection—mapping a lemma to an indicated inflected fo...
Computational morphology is a core component in many different types of natural language processing,...
We present a novel way of generating un-seen words, which is useful for certain ap-plications such a...