In this paper we present results of experiments with Chinese word segmentation and information retrieval. Our experiments with three different word segmentation algorithms indicate that accurate segmentation measurably improves retrieval performance. We discuss the evaluation of word segmentation algorithms for the purpose of better indexing segmented texts for retrieval. Introduction The increased interest in crosslingual and multilingual information retrieval has revealed the new challenges inherent in retrieval in multiple languages. English IR has been extensively engineered for 30 years, with the development of stop lists, stemming, etc., but such resources are not available for many languages. Recent Text Retrieval Conferences (TREC-...
Abstract. Word segmentation has been shown helpful for Chinese-to-English machine translation (MT), ...
Chinese word segmentation is the first step for Chinese text processing. The accuracy of Chinese wor...
We conducted a preliminary study to examine whether Chinese readers' spontaneous word segmentati...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
With the widespread of the Internet, great research in-terests are being shown in Chinese language i...
We investigate the effects of lexicon size and stopwords on Chinese information retrieval using our ...
A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no sp...
In this fast growing information age, information retrieval (IR) systems and their related fields h...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identif...
We participated in the CLEF 2001 monolingual, bilingual, and multilingual tasks. Our interests in th...
Most of the Chinese word segmentation systems utilizes monolingual dictionary and are used for monol...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
Automatic segmentation and overlapping bigrams are the most common methods for overcoming the lack o...
Word segmentation is the first step in Chinese information processing, and the performance of the se...
Abstract. Word segmentation has been shown helpful for Chinese-to-English machine translation (MT), ...
Chinese word segmentation is the first step for Chinese text processing. The accuracy of Chinese wor...
We conducted a preliminary study to examine whether Chinese readers' spontaneous word segmentati...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
With the widespread of the Internet, great research in-terests are being shown in Chinese language i...
We investigate the effects of lexicon size and stopwords on Chinese information retrieval using our ...
A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no sp...
In this fast growing information age, information retrieval (IR) systems and their related fields h...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identif...
We participated in the CLEF 2001 monolingual, bilingual, and multilingual tasks. Our interests in th...
Most of the Chinese word segmentation systems utilizes monolingual dictionary and are used for monol...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
Automatic segmentation and overlapping bigrams are the most common methods for overcoming the lack o...
Word segmentation is the first step in Chinese information processing, and the performance of the se...
Abstract. Word segmentation has been shown helpful for Chinese-to-English machine translation (MT), ...
Chinese word segmentation is the first step for Chinese text processing. The accuracy of Chinese wor...
We conducted a preliminary study to examine whether Chinese readers' spontaneous word segmentati...