With the widespread of the Internet, great research in-terests are being shown in Chinese language information retrieval in recent years. The absence of word bound-aries in Chinese language makes Chinese information re-trieval(IR) different to European IR. In order to apply tradi-tional IR approaches to Chinese language, sentences have to be segmented into words first. Word segmentation is playing a key role in Chinese IR. As word segmentation is not straightforward and the results are sometime am-biguous, n-grams are used as an alternative. Several ex-perimental studies have been conducted to compare words and n-grams[5, 6], word segmentation and its effect on in-formation retrieval[3]. These studies show that using ei-ther words or n-gram...
In order to analyze security and terrorism related content in Chinese, it is important to perform wo...
Chinese texts are character-based, not word-based, and there is no boundary mark between words in Ch...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
In this paper we present results of experiments with Chinese word segmentation and information retri...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
Summary In many languages there are no word delimiters among the text. It is very difficult to index...
We investigate whether suffix related features can significantly improve the performance of characte...
A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no sp...
Chinese information search engines always encounter a difficulty in segmentation of Chinese words fr...
As the amount of online Chinese contents grows, there is a critical need for effective Chinese word ...
In this fast growing information age, information retrieval (IR) systems and their related fields h...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identif...
We investigate the effects of lexicon size and stopwords on Chinese information retrieval using our ...
In order to analyze security and terrorism related content in Chinese, it is important to perform wo...
Chinese texts are character-based, not word-based, and there is no boundary mark between words in Ch...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
In this paper we present results of experiments with Chinese word segmentation and information retri...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
Summary In many languages there are no word delimiters among the text. It is very difficult to index...
We investigate whether suffix related features can significantly improve the performance of characte...
A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no sp...
Chinese information search engines always encounter a difficulty in segmentation of Chinese words fr...
As the amount of online Chinese contents grows, there is a critical need for effective Chinese word ...
In this fast growing information age, information retrieval (IR) systems and their related fields h...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identif...
We investigate the effects of lexicon size and stopwords on Chinese information retrieval using our ...
In order to analyze security and terrorism related content in Chinese, it is important to perform wo...
Chinese texts are character-based, not word-based, and there is no boundary mark between words in Ch...
This paper presents a Chinese word segmentation system that uses improved source-channel models of C...