To automatically extract Chinese collocations and build a large-scale collocation bank, we are developing a one-million-word Chinese shallow parsed treebank. The treebank can be used not only as a training set for our. shallow parser, but also as processed data from which collocations are extracted. This paper presents several issues related to this on-going project, such as our definition of shallow parsing used in Chinese collocation extraction, guideline preparation, and quality control.Computer Science, Artificial IntelligenceComputer Science, Theory & MethodsLanguage & LinguisticsSCI(E)CPCI-S(ISTP)CPCI-SSH(ISSHP)
Preparation of knowledge bank is a very difficult task. In this paper, we discuss the knowledge extr...
Word segmentation, part-of-speech (POS) tagging, and syntactic parsing are three fundamental Chinese...
This paper suggests a methodology which is aimed to extract the terminologically relevant collocatio...
To automatically extract Chinese collocations and build a large-scale collocation bank, we are devel...
xvi, 196 p. : ill. ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577P COMP 2006 XuCollocation is a le...
xiii, 172 p. : ill. ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577P COMP 2007 LiThe tranditional a...
Collocation extraction systems based on pure statistical methods suffer from two major problems. The...
We present a system which extracts word-based bigram and n-gram collocation information from a 60MB ...
In this paper, we present two machine-learning algorithms, namely, transformation-based error-driven...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, par...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, par...
This document describes the segmentation guidelines for the Penn Chinese Treebank Project. The goal ...
10.1109/WIIAT.2008.72Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence an...
The increasingly widespread application of natural language processing technology leads parsing to p...
This document describes the segmentation guidelines for the Penn Chinese Treebank Project. The goal ...
Preparation of knowledge bank is a very difficult task. In this paper, we discuss the knowledge extr...
Word segmentation, part-of-speech (POS) tagging, and syntactic parsing are three fundamental Chinese...
This paper suggests a methodology which is aimed to extract the terminologically relevant collocatio...
To automatically extract Chinese collocations and build a large-scale collocation bank, we are devel...
xvi, 196 p. : ill. ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577P COMP 2006 XuCollocation is a le...
xiii, 172 p. : ill. ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577P COMP 2007 LiThe tranditional a...
Collocation extraction systems based on pure statistical methods suffer from two major problems. The...
We present a system which extracts word-based bigram and n-gram collocation information from a 60MB ...
In this paper, we present two machine-learning algorithms, namely, transformation-based error-driven...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, par...
With growing interest in Chinese Language Processing, numerous NLP tools (e.g., word segmenters, par...
This document describes the segmentation guidelines for the Penn Chinese Treebank Project. The goal ...
10.1109/WIIAT.2008.72Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence an...
The increasingly widespread application of natural language processing technology leads parsing to p...
This document describes the segmentation guidelines for the Penn Chinese Treebank Project. The goal ...
Preparation of knowledge bank is a very difficult task. In this paper, we discuss the knowledge extr...
Word segmentation, part-of-speech (POS) tagging, and syntactic parsing are three fundamental Chinese...
This paper suggests a methodology which is aimed to extract the terminologically relevant collocatio...