This paper presents a tagging approach to Chinese unknown word identification based on lexicalized hidden Markov models (LHMMs). In this work, Chinese unknown word identification is represented as a tagging task on a sequence of known words by introducing word-formation patterns and part-of-speech. Based on the lexicalized HMMs, a statistical tagger is further developed to assign each known word an appropriate tag that indicates its pattern in forming a word and the part-of-speech of the formed word. The experimental results on the Peking University corpus indicate that the use of lexicalization technique and the introduction of part-of-speech are helpful to unknown word identification. The experiment on the SIGHAN-PK open test data also sh...
A fast statistical method for Chinese unknown word detection is proposed. It is based on association...
1 We define unknown words to be those not included in the core lexicon and unable to be generated by...
In this paper, an effective approach for Chinese speech recognition on small vocabulary size is prop...
This paper presents a tagging approach to Chinese unknown word identification based on lexicalized h...
This paper presents a modified class-based LM approach to Chinese unknown word identification. In th...
Word segmentation, part-of-speech (POS) tagging, and sense tagging are important steps in various Ch...
This paper presents a unified approach for Chinese lexical analysis using hierarchical hidden Markov...
This paper presents a lexicalized HMM-based approach to Chinese named entity recognition (NER). To t...
This paper presents a unified solution, which is based on the idea of “roles tagging”, to the compli...
This paper presents a lexicalized HMM-based approach to Chinese named entity recognition (NER). To t...
Word segmentation is a basic step in Chinese text processing. The identification of unknown words is...
This paper investigates the recognition of unknown words in Chinese parsing. Two methods are propose...
According to a survey in a corpus, majority of the unknown words in Chinese texts are numbers, time ...
Copyright © 2014 Qiuping Huang et al.This is an open access article distributed under the Creative C...
There is no blank to mark word boundaries in Chinese text. As a result, identifying words is difficu...
A fast statistical method for Chinese unknown word detection is proposed. It is based on association...
1 We define unknown words to be those not included in the core lexicon and unable to be generated by...
In this paper, an effective approach for Chinese speech recognition on small vocabulary size is prop...
This paper presents a tagging approach to Chinese unknown word identification based on lexicalized h...
This paper presents a modified class-based LM approach to Chinese unknown word identification. In th...
Word segmentation, part-of-speech (POS) tagging, and sense tagging are important steps in various Ch...
This paper presents a unified approach for Chinese lexical analysis using hierarchical hidden Markov...
This paper presents a lexicalized HMM-based approach to Chinese named entity recognition (NER). To t...
This paper presents a unified solution, which is based on the idea of “roles tagging”, to the compli...
This paper presents a lexicalized HMM-based approach to Chinese named entity recognition (NER). To t...
Word segmentation is a basic step in Chinese text processing. The identification of unknown words is...
This paper investigates the recognition of unknown words in Chinese parsing. Two methods are propose...
According to a survey in a corpus, majority of the unknown words in Chinese texts are numbers, time ...
Copyright © 2014 Qiuping Huang et al.This is an open access article distributed under the Creative C...
There is no blank to mark word boundaries in Chinese text. As a result, identifying words is difficu...
A fast statistical method for Chinese unknown word detection is proposed. It is based on association...
1 We define unknown words to be those not included in the core lexicon and unable to be generated by...
In this paper, an effective approach for Chinese speech recognition on small vocabulary size is prop...