This paper presents an extensive empirical study on two language modeling techniques, linguistically-motivated word skipping and predictive clustering, both of which are used in capturing long distance word dependencies that are beyond the scope of a word trigram model. We compare the techniques to others that were proposed previously for the same purpose. We evaluate the resulting models on the task of Japanese Kana-Kanji conversion. We show that the two techniques, while simple, outperform existing methods studied in this paper, and lead to language models that perform significantly better than a word trigram model. We also investigate how factors such as training corpus size and genre affect the performance of the models.
A new language model is presented which incorporates local N-gram dependencies with two important so...
Language models (LMs) are essential components of many applications such as speech recognition or ma...
This paper deals with the issue of language model selection based on the analysis of data collection...
This paper presents several practical ways of incorporating linguistic structure into language model...
This thesis investigates an approach to exploiting the long context based on the information about t...
Abstract In studies of human cognition, Bayesian models are increasingly popular tools for understan...
This paper presents an empirical study on four techniques of language model adaptation, including a ...
This thesis investigates the various syntactic sources of difficulty or ease in processing Japanese ...
This paper presents a hybrid model for han-dling out-of-vocabulary words in Japanese-to-English stat...
Abstract. This paper presents an empirical study on four techniques of language model adaptation, in...
Ngram modeling is simple in language modeling and has been widely used in many applications. However...
A new scheme of N-gram language modeling was pro-posed for Japanese, where word N-grams were calcula...
ABSTRACT Recurrent neural networks (RNNs) are a very recent technique to model long range dependenci...
This paper is concerned with capturing long-distance dependencies in sequence models. We propose a t...
We select three word segmentation models with psycholinguistic foundations - transitional probabilit...
A new language model is presented which incorporates local N-gram dependencies with two important so...
Language models (LMs) are essential components of many applications such as speech recognition or ma...
This paper deals with the issue of language model selection based on the analysis of data collection...
This paper presents several practical ways of incorporating linguistic structure into language model...
This thesis investigates an approach to exploiting the long context based on the information about t...
Abstract In studies of human cognition, Bayesian models are increasingly popular tools for understan...
This paper presents an empirical study on four techniques of language model adaptation, including a ...
This thesis investigates the various syntactic sources of difficulty or ease in processing Japanese ...
This paper presents a hybrid model for han-dling out-of-vocabulary words in Japanese-to-English stat...
Abstract. This paper presents an empirical study on four techniques of language model adaptation, in...
Ngram modeling is simple in language modeling and has been widely used in many applications. However...
A new scheme of N-gram language modeling was pro-posed for Japanese, where word N-grams were calcula...
ABSTRACT Recurrent neural networks (RNNs) are a very recent technique to model long range dependenci...
This paper is concerned with capturing long-distance dependencies in sequence models. We propose a t...
We select three word segmentation models with psycholinguistic foundations - transitional probabilit...
A new language model is presented which incorporates local N-gram dependencies with two important so...
Language models (LMs) are essential components of many applications such as speech recognition or ma...
This paper deals with the issue of language model selection based on the analysis of data collection...