Chinese word structure annotation is potential-ly useful for many NLP tasks, especially for Chinese word segmentation. Li and Zhou (2012) have presented an annotation for word structures in the Penn Chinese Treebank. But they only consider words that have productive affixes, which covers 35 % of word types in that corpus. In this paper, we propose a lin-guistically inspired annotation that covers var-ious morphological derivations of Chinese in a more general way, such that almost all multi-ple-character words can be structurally ana-lyzed. As manual annotation is expensive, we propose a semi-supervised approach to auto-matic annotation, which combines the maxi-mum entropy learning and the EM iteration for the Gaussian mixture model. The pr...
(PKU). Based on a maximum entropy approach, our word segmenter achieved the highest F measure for AS...
Currently, the best performing models for Chinese word segmentation (CWS) are extremely re-source in...
A morphological family in Chinese is the set of compound words embedding a common morpheme. Self-org...
Annotated word structures are useful for various Chinese NLP tasks, such as word segmentation, POS t...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, par...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, par...
Parsing, the task of identifying syntactic components, e.g., noun and verb phrases, in a sentence, i...
Human labeled corpus is indispensable for the training of supervised word segmenters. However, it is...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...
This is a pilot study which aims at the design of a Chinese morphological analyzer which is in state...
Nowadays supervised sequence labeling models can reach competitive performance on the task of Chines...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...
We address the issue of consuming heterogeneous annotation data for Chinese word segmentation and pa...
The Sentence-based Grammar System which was created by linguist Jinxi Li, is one of the most represe...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g. word segmenters, part...
(PKU). Based on a maximum entropy approach, our word segmenter achieved the highest F measure for AS...
Currently, the best performing models for Chinese word segmentation (CWS) are extremely re-source in...
A morphological family in Chinese is the set of compound words embedding a common morpheme. Self-org...
Annotated word structures are useful for various Chinese NLP tasks, such as word segmentation, POS t...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, par...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, par...
Parsing, the task of identifying syntactic components, e.g., noun and verb phrases, in a sentence, i...
Human labeled corpus is indispensable for the training of supervised word segmenters. However, it is...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...
This is a pilot study which aims at the design of a Chinese morphological analyzer which is in state...
Nowadays supervised sequence labeling models can reach competitive performance on the task of Chines...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...
We address the issue of consuming heterogeneous annotation data for Chinese word segmentation and pa...
The Sentence-based Grammar System which was created by linguist Jinxi Li, is one of the most represe...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g. word segmenters, part...
(PKU). Based on a maximum entropy approach, our word segmenter achieved the highest F measure for AS...
Currently, the best performing models for Chinese word segmentation (CWS) are extremely re-source in...
A morphological family in Chinese is the set of compound words embedding a common morpheme. Self-org...