We investigate the effects of lexicon size and stopwords on Chinese information retrieval using our method of short-word segmentation based on simple language usage rules and statistics. These rules allow us to employ a small lexicon of only 2,175 entries and provide quite admirable retrieval results. It is noticed that accurate segmentation is not essential for good retrieval. Larger lexicons can lead to incremental improvements. The presence of stopwords do not contribute much noise to IR. Their removal risks elimination of crucial words in a query and adversely affect retrieval, especially when the queries are short. Short queries of a few words perform more than 10% worse than paragraph-size queries
We conducted a preliminary study to examine whether Chinese readers' spontaneous word segmentation p...
[[abstract]]Chinese word segmentation is an essential step in a processing of Chinese natural langua...
[[abstract]]In this paper, we use a lexical method to do sentence alignment for an English-Chinese c...
In this paper we present results of experiments with Chinese word segmentation and information retri...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
With the widespread of the Internet, great research in-terests are being shown in Chinese language i...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identif...
A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no sp...
Automatic segmentation and overlapping bigrams are the most common methods for overcoming the lack o...
In this fast growing information age, information retrieval (IR) systems and their related fields h...
We conducted a preliminary study to examine whether Chinese readers' spontaneous word segmentati...
Abstract:- In this paper, we use a lexical method to do sentence alignment for an English-Chinese co...
This experiment tests the effectiveness of Chinese information retrieval using a segmenter that is d...
We conducted a preliminary study to examine whether Chinese readers' spontaneous word segmentation p...
[[abstract]]Chinese word segmentation is an essential step in a processing of Chinese natural langua...
[[abstract]]In this paper, we use a lexical method to do sentence alignment for an English-Chinese c...
In this paper we present results of experiments with Chinese word segmentation and information retri...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
With the widespread of the Internet, great research in-terests are being shown in Chinese language i...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identif...
A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no sp...
Automatic segmentation and overlapping bigrams are the most common methods for overcoming the lack o...
In this fast growing information age, information retrieval (IR) systems and their related fields h...
We conducted a preliminary study to examine whether Chinese readers' spontaneous word segmentati...
Abstract:- In this paper, we use a lexical method to do sentence alignment for an English-Chinese co...
This experiment tests the effectiveness of Chinese information retrieval using a segmenter that is d...
We conducted a preliminary study to examine whether Chinese readers' spontaneous word segmentation p...
[[abstract]]Chinese word segmentation is an essential step in a processing of Chinese natural langua...
[[abstract]]In this paper, we use a lexical method to do sentence alignment for an English-Chinese c...