A distinctive feature of Chinese test is that a Chinese document is a sequence of Chinese with no space or boundary between Chinese words. This feature makes Chinese information retrieval more difficult since a retrieved document which contains the query term as a sequence of Chinese characters may not be really relevant to the query since the query term (as a sequence Chinese characters) may not be a valid Chinese word in that documents. On the other hand, a document that is actually relevant may not be retrieved because it does not contain the query sequence but contains other relevant words. In this research, we propose a hybrid Chinese information retrieval model by incorporating word-based techniques with the traditional character-base...
Automatic indexing is the automatic creation of a text surrogate, normally keywords or phrases, to r...
This paper presents the results of experiments in which the authors tested different types of featur...
This paper describes a hybrid Chinese word segmenter that is being developed as part of a larger Chi...
In this paper we present results of experiments with Chinese word segmentation and information retri...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
In this fast growing information age, information retrieval (IR) systems and their related fields h...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
Different models retrieve the documents based on different approaches of extracting the underlying c...
With the widespread of the Internet, great research in-terests are being shown in Chinese language i...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identif...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
With the advent of the Internet and intranets, substantial interest is being shown in Asian language...
We investigate the effects of lexicon size and stopwords on Chinese information retrieval using our ...
Automatic segmentation and overlapping bigrams are the most com-mon methods for overcoming the lack ...
Chinese texts are character-based, not word-based, and there is no boundary mark between words in Ch...
Automatic indexing is the automatic creation of a text surrogate, normally keywords or phrases, to r...
This paper presents the results of experiments in which the authors tested different types of featur...
This paper describes a hybrid Chinese word segmenter that is being developed as part of a larger Chi...
In this paper we present results of experiments with Chinese word segmentation and information retri...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identi...
In this fast growing information age, information retrieval (IR) systems and their related fields h...
Chinese word segmentation is a prerequisite process in Chinese information retrieval (IR) to divide ...
Different models retrieve the documents based on different approaches of extracting the underlying c...
With the widespread of the Internet, great research in-terests are being shown in Chinese language i...
In the processing of Chinese documents and queries in information retrieval (IR), one has to identif...
The majority of recent Cross-Language Information Retrieval (CLIR) research has focused on European ...
With the advent of the Internet and intranets, substantial interest is being shown in Asian language...
We investigate the effects of lexicon size and stopwords on Chinese information retrieval using our ...
Automatic segmentation and overlapping bigrams are the most com-mon methods for overcoming the lack ...
Chinese texts are character-based, not word-based, and there is no boundary mark between words in Ch...
Automatic indexing is the automatic creation of a text surrogate, normally keywords or phrases, to r...
This paper presents the results of experiments in which the authors tested different types of featur...
This paper describes a hybrid Chinese word segmenter that is being developed as part of a larger Chi...