Chinese texts are character-based, not word-based, and there is no boundary mark between words in Chinese sentences. Each Chinese character stands for one phonological syllable and, in most cases, represents a morpheme. This raises a problem because, in Chinese, less than 10% of the word types (and less than 50% of the tokens in a text) are composed of a single character. In most Chinese IR tasks, identifying keywords is difficult because of segmentation ambiguities and the occurrence of unknown words. As a result, a great deal of research has focused on extracting words from raw Chinese texts (i.e., sentences without text segmentation). In this dissertation, we have proposed two different approaches to deal with Chinese natural language pr...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
[[abstract]]In this paper, we use a lexical method to do sentence alignment for an English-Chinese c...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
Automatic indexing is the automatic creation of a text surrogate, normally keywords or phrases, to r...
In this fast growing information age, information retrieval (IR) systems and their related fields h...
With the widespread of the Internet, great research in-terests are being shown in Chinese language i...
Automatic Chinese Word Segmentation is one of the basic research issues on text categorization, auto...
Query expansion has long been suggested as a technique for dealing with word mismatch problem in inf...
This paper reveals some important properties of CFSs and applications in Chinese natural language pr...
A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no sp...
Abstract:- In this paper, we use a lexical method to do sentence alignment for an English-Chinese co...
Abstract—The continually and high-rate growth of China's economy has attracted more and more in...
xvi, 196 p. : ill. ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577P COMP 2006 XuCollocation is a le...
In this paper we present results of experiments with Chinese word segmentation and information retri...
Textual information written in Chinese now represents a huge knowledge repository. The first step of...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
[[abstract]]In this paper, we use a lexical method to do sentence alignment for an English-Chinese c...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
Automatic indexing is the automatic creation of a text surrogate, normally keywords or phrases, to r...
In this fast growing information age, information retrieval (IR) systems and their related fields h...
With the widespread of the Internet, great research in-terests are being shown in Chinese language i...
Automatic Chinese Word Segmentation is one of the basic research issues on text categorization, auto...
Query expansion has long been suggested as a technique for dealing with word mismatch problem in inf...
This paper reveals some important properties of CFSs and applications in Chinese natural language pr...
A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no sp...
Abstract:- In this paper, we use a lexical method to do sentence alignment for an English-Chinese co...
Abstract—The continually and high-rate growth of China's economy has attracted more and more in...
xvi, 196 p. : ill. ; 30 cm.PolyU Library Call No.: [THS] LG51 .H577P COMP 2006 XuCollocation is a le...
In this paper we present results of experiments with Chinese word segmentation and information retri...
Textual information written in Chinese now represents a huge knowledge repository. The first step of...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
[[abstract]]In this paper, we use a lexical method to do sentence alignment for an English-Chinese c...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...