[[abstract]]The process of text categorization involves some understanding of the content of the documents and/or some previous knowledge of the categories. For the content of the documents, we use the filtering measure for feature selection in our Chinese text categorization system. We modify the formula of TFIDF to strengthen important keywords’ weights and weaken unimportant keywords’ weights. For the knowledge of the categories, we use association rules to improve the precision of text classification and use category priority to represent the relationship between two different categories. Consequently, the experimental results show that our method can effectively not only decrease noise text but also increase the ratio of precision and ...
Text pre-processing is an important component of a Chinese text classification. At present, however,...
A filter feature selection process for text categorization system consists of two main stages: the t...
Automatic text categorization is the task of assigning natural language text documents to predefined...
[[abstract]]The process of text categorization involves some understanding of the content of the doc...
Abstract: Giving further consideration on linguistic feature, this study proposes an algorithm of Ch...
[[abstract]]Recently research on text mining has attracted lots of attention from both industrial an...
Words and n-grams are commonly used Chinese text representing units and are proved to be good featur...
Text categorization task always suffers from a high dimension problem, which leads the learning syst...
[[abstract]]In this paper, we propose and evaluate approaches to categorizing Chinese texts, which c...
Words and n-grams are commonly used Chinese text representing units and are proved to be good featur...
This paper is a comparative study on representing units in Chinese text categorization. Several kind...
A good text classifier is a classifier that efficiently categorizes large sets of text documents in ...
Considering the explosive growth of data, the increased amount of text data’s effect on the performa...
Automatic text categorization is the task of assigning natural language text documents to predefined...
Text pre-processing is an important component of a Chinese text classification. At present, however,...
Text pre-processing is an important component of a Chinese text classification. At present, however,...
A filter feature selection process for text categorization system consists of two main stages: the t...
Automatic text categorization is the task of assigning natural language text documents to predefined...
[[abstract]]The process of text categorization involves some understanding of the content of the doc...
Abstract: Giving further consideration on linguistic feature, this study proposes an algorithm of Ch...
[[abstract]]Recently research on text mining has attracted lots of attention from both industrial an...
Words and n-grams are commonly used Chinese text representing units and are proved to be good featur...
Text categorization task always suffers from a high dimension problem, which leads the learning syst...
[[abstract]]In this paper, we propose and evaluate approaches to categorizing Chinese texts, which c...
Words and n-grams are commonly used Chinese text representing units and are proved to be good featur...
This paper is a comparative study on representing units in Chinese text categorization. Several kind...
A good text classifier is a classifier that efficiently categorizes large sets of text documents in ...
Considering the explosive growth of data, the increased amount of text data’s effect on the performa...
Automatic text categorization is the task of assigning natural language text documents to predefined...
Text pre-processing is an important component of a Chinese text classification. At present, however,...
Text pre-processing is an important component of a Chinese text classification. At present, however,...
A filter feature selection process for text categorization system consists of two main stages: the t...
Automatic text categorization is the task of assigning natural language text documents to predefined...