Morph length is one of the indicative feature that helps learning the morphology of languages, in particular agglutinative languages. In this paper, we introduce a simple unsupervised model for morphological segmentation and study how the knowledge of morph length affect the performance of the segmentation task under the Bayesian framework. The model is based on (Goldwater et al., 2006) unigram word segmentation model and assumes a simple prior distribution over morph length. We experiment this model on two highly related and agglutinative languages namely Tamil and Telugu, and compare our results with the state of the art Morfessor system. We show that, knowledge of morph length has a positive impact and provides competitive results in ter...
<p>We present a morphology-aware nonparametric Bayesian model of language whose prior distribution u...
Many Uralic languages have a rich morphological structure, but lack morphological analysis tools nee...
This article presents an unsupervised morphological analysis algorithm to segment words into roots a...
We present a language-independent and unsupervised algorithm for the segmenta-tion of words into mor...
In this paper we describe a method to morphologically segment highly agglutinating and inflectional ...
We present two methods for unsupervised segmentation of words into morpheme-like units. The model ...
We present two methods for unsupervised segmentation of words into morpheme-like units. The model ut...
In this paper, we introduce a trie-structured Bayesian model for unsupervised morphological segmenta...
This thesis work introduces an approach to unsupervised learning of morphological structure of human...
In this paper an automatic morphology learning system for complex and agglutinative languages is pre...
In this paper a methodology for learning the complex agglutinative morphology of some Indian languag...
Unsupervised learning of morphology is used for automatic affix identification, morphological segmen...
Many Uralic languages have a rich morphological structure, but lack tools of morphological analysis ...
How can infants detect where words or morphemes start and end in the continuous stream of speech? Pr...
This work presents an algorithm for the unsupervised learning, or induction, of a simple morphology ...
<p>We present a morphology-aware nonparametric Bayesian model of language whose prior distribution u...
Many Uralic languages have a rich morphological structure, but lack morphological analysis tools nee...
This article presents an unsupervised morphological analysis algorithm to segment words into roots a...
We present a language-independent and unsupervised algorithm for the segmenta-tion of words into mor...
In this paper we describe a method to morphologically segment highly agglutinating and inflectional ...
We present two methods for unsupervised segmentation of words into morpheme-like units. The model ...
We present two methods for unsupervised segmentation of words into morpheme-like units. The model ut...
In this paper, we introduce a trie-structured Bayesian model for unsupervised morphological segmenta...
This thesis work introduces an approach to unsupervised learning of morphological structure of human...
In this paper an automatic morphology learning system for complex and agglutinative languages is pre...
In this paper a methodology for learning the complex agglutinative morphology of some Indian languag...
Unsupervised learning of morphology is used for automatic affix identification, morphological segmen...
Many Uralic languages have a rich morphological structure, but lack tools of morphological analysis ...
How can infants detect where words or morphemes start and end in the continuous stream of speech? Pr...
This work presents an algorithm for the unsupervised learning, or induction, of a simple morphology ...
<p>We present a morphology-aware nonparametric Bayesian model of language whose prior distribution u...
Many Uralic languages have a rich morphological structure, but lack morphological analysis tools nee...
This article presents an unsupervised morphological analysis algorithm to segment words into roots a...