The observed pronunciations or spellings of words are often explained as arising from the “underlying forms” of their mor- phemes. These forms are latent strings that linguists try to reconstruct by hand. We propose to reconstruct them automatically at scale, enabling generalization to new words. Given some surface word types of a concatenative language along with the abstract morpheme sequences that they ex- press, we show how to recover consistent underlying forms for these morphemes, together with the (stochastic) phonology that maps each concatenation of underly- ing forms to a surface form. Our technique involves loopy belief propagation in a nat- ural directed graphical model whose vari- ables are unknown strings and whose con- dition...
This paper examines implications for morpho-phonology of a model that minimizes the role of an innat...
Traditional phonology presupposes abstract underlying representations (UR) and a set of rules to exp...
The field of statistical natural language processing has been turning toward morpholog-ically rich l...
A language independent model for recognition and production of word forms is presented. This "two-le...
We present a probabilistic model of phonotactics, the set of well-formed phoneme sequences in a lang...
This paper describes a system for the unsupervised learning of morpho-logical suffixes and stems fro...
Translation into morphologically-rich languages challenges neural machine translation (NMT) models w...
This paper presents an algorithm for the unsuper-vised learning of a simple morphology of a nat-ural...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
In this paper we propose a computational model for the representation and processing of morpho-phono...
We address the problem of predicting a word from previous words in a sample of text. In particular, ...
The techniques of using neural networks to learn distributed word representations (i.e., word embedd...
We present a generative model of phono-tactics, the set of well-formed phoneme sequences in a langua...
This work presents an algorithm for the unsupervised learning, or induction, of a simple morphology ...
Zellig Harris proposed a method for grouping phonemes in an utterance into morphemes by simply usin...
This paper examines implications for morpho-phonology of a model that minimizes the role of an innat...
Traditional phonology presupposes abstract underlying representations (UR) and a set of rules to exp...
The field of statistical natural language processing has been turning toward morpholog-ically rich l...
A language independent model for recognition and production of word forms is presented. This "two-le...
We present a probabilistic model of phonotactics, the set of well-formed phoneme sequences in a lang...
This paper describes a system for the unsupervised learning of morpho-logical suffixes and stems fro...
Translation into morphologically-rich languages challenges neural machine translation (NMT) models w...
This paper presents an algorithm for the unsuper-vised learning of a simple morphology of a nat-ural...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
In this paper we propose a computational model for the representation and processing of morpho-phono...
We address the problem of predicting a word from previous words in a sample of text. In particular, ...
The techniques of using neural networks to learn distributed word representations (i.e., word embedd...
We present a generative model of phono-tactics, the set of well-formed phoneme sequences in a langua...
This work presents an algorithm for the unsupervised learning, or induction, of a simple morphology ...
Zellig Harris proposed a method for grouping phonemes in an utterance into morphemes by simply usin...
This paper examines implications for morpho-phonology of a model that minimizes the role of an innat...
Traditional phonology presupposes abstract underlying representations (UR) and a set of rules to exp...
The field of statistical natural language processing has been turning toward morpholog-ically rich l...