This paper presents several practical ways of incorporating linguistic structure into language models. A headword detector is first applied to detect the headword of each phrase in a sentence. A permuted headword trigram model (PHTM) is then generated from the annotated corpus. Finally, PHTM is extended to a cluster PHTM (C-PHTM) by defining clusters for similar words in the corpus. We evaluated the proposed models on the realistic application of Japanese Kana-Kanji conversion. Experiments show that C-PHTM achieves 15 % error rate reduction over the word trigram model. Thi
A new scheme of N-gram language modeling was pro-posed for Japanese, where word N-grams were calcula...
In this article, we propose a new postprocessing strategy, word suggestion, based on a multiple word...
© 2015 IEEE. Compounding is one of the most productive word formation processes in many languages an...
This paper presents an extensive empirical study on two language modeling techniques, linguistically...
Language models are an important component of speech recognition. They aim to predict the probabilit...
Copyright c©1998 by The Association for Computational Linguistics The paper presents a language mode...
Abstract. In this paper, we present a language model based on clusters obtained by applying regular ...
It has been established that incorporating word cluster features derived from large unlabeled corpor...
Pelemans J., Van hamme H., Wambacq P., ''Translation-based word clustering for language models'', Bo...
Many of the kinds of language model used in speech understanding suffer from imperfect modeling of i...
In this paper we describe a word clustering method for class-based n-gram model. The measurement for...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
Croft (2001) argues that distributional analysis of word classes is doomed to failure because there ...
In the field of natural language processing (NLP), recent research has shown that deep neural networ...
in this paper we present a novel clustering technique for compound words. By mapping compounds onto ...
A new scheme of N-gram language modeling was pro-posed for Japanese, where word N-grams were calcula...
In this article, we propose a new postprocessing strategy, word suggestion, based on a multiple word...
© 2015 IEEE. Compounding is one of the most productive word formation processes in many languages an...
This paper presents an extensive empirical study on two language modeling techniques, linguistically...
Language models are an important component of speech recognition. They aim to predict the probabilit...
Copyright c©1998 by The Association for Computational Linguistics The paper presents a language mode...
Abstract. In this paper, we present a language model based on clusters obtained by applying regular ...
It has been established that incorporating word cluster features derived from large unlabeled corpor...
Pelemans J., Van hamme H., Wambacq P., ''Translation-based word clustering for language models'', Bo...
Many of the kinds of language model used in speech understanding suffer from imperfect modeling of i...
In this paper we describe a word clustering method for class-based n-gram model. The measurement for...
Neural architectures are prominent in the construction of language models (LMs). However, word-leve...
Croft (2001) argues that distributional analysis of word classes is doomed to failure because there ...
In the field of natural language processing (NLP), recent research has shown that deep neural networ...
in this paper we present a novel clustering technique for compound words. By mapping compounds onto ...
A new scheme of N-gram language modeling was pro-posed for Japanese, where word N-grams were calcula...
In this article, we propose a new postprocessing strategy, word suggestion, based on a multiple word...
© 2015 IEEE. Compounding is one of the most productive word formation processes in many languages an...