For languages with fast vocabulary growth and limited resources, data sparsity leads to challenges in training a language model. One strategy for addressing this problem is to leverage morphological structure as features in the model. This paper explores different uses of unsupervised morphological features in both the history and prediction space for three word-based exponential models (maximum entropy, logbilinear, and recurrent neural net (RNN)). Multi-task training is introduced as a regularizing mechanism to improve performance in the continuous-space approaches. The models are compared to non-parametric baselines. From using the RNN with morphological features and multi-task learning, experiments with conversational speech from four l...
We introduce an exponential language model which mod-els a whole sentence or utterance as a single u...
Maximum entropy models are considered by many to be one of the most promising avenues of language mo...
Virtually any modern speech recognition system relies on count-based language models. In this thesis...
For languages with fast vocabulary growth and limited resources, data sparsity leads to challenges i...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
Abstract Models of morphologically rich languages suffer from data sparsity when words are treated a...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
Determining optimal units of representing morphologically complex words in the mental lexicon is a c...
Determining optimal units of representing morphologically complex words in the mental lexicon is a c...
This thesis focuses on unsupervised morphological seg- mentation, the fundamental task in NLP which ...
Determining optimal units of representing morphologically complex words in the mental lexicon is a c...
We present a new morphological analy-sis model that considers semantic plausi-bility of word sequenc...
Recent research has shown promise in multilingual modeling, demonstrating how a single model is capa...
This thesis focuses on unsupervised morphological seg- mentation, the fundamental task in NLP which ...
In this paper we present a survey on the application of recurrent neural networks to the task of sta...
We introduce an exponential language model which mod-els a whole sentence or utterance as a single u...
Maximum entropy models are considered by many to be one of the most promising avenues of language mo...
Virtually any modern speech recognition system relies on count-based language models. In this thesis...
For languages with fast vocabulary growth and limited resources, data sparsity leads to challenges i...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
Abstract Models of morphologically rich languages suffer from data sparsity when words are treated a...
In recent years neural language models (LMs) have set state-of-the-art performance for several bench...
Determining optimal units of representing morphologically complex words in the mental lexicon is a c...
Determining optimal units of representing morphologically complex words in the mental lexicon is a c...
This thesis focuses on unsupervised morphological seg- mentation, the fundamental task in NLP which ...
Determining optimal units of representing morphologically complex words in the mental lexicon is a c...
We present a new morphological analy-sis model that considers semantic plausi-bility of word sequenc...
Recent research has shown promise in multilingual modeling, demonstrating how a single model is capa...
This thesis focuses on unsupervised morphological seg- mentation, the fundamental task in NLP which ...
In this paper we present a survey on the application of recurrent neural networks to the task of sta...
We introduce an exponential language model which mod-els a whole sentence or utterance as a single u...
Maximum entropy models are considered by many to be one of the most promising avenues of language mo...
Virtually any modern speech recognition system relies on count-based language models. In this thesis...