Abstract: Giving further consideration on linguistic feature, this study proposes an algorithm of Chinese text categorization based on sense group. The algorithm extracts sense group by analyzing syntactic and semantic properties of Chinese texts and builds the category sense group library. SVM is used for the experiment of text categorization. The experimental results show that the precision and recall of the new algorithm based on sense group is better than that of traditional algorithms
三重大学大学院工学研究科博士前期課程情報工学専攻Automatic text classification (ATC) is the task to automatically assign one ...
This paper is a comparative study on representing units in Chinese text categorization. Several kind...
Abstract. The paper describes an application of classification algorithms to the text categorization...
[[abstract]]The process of text categorization involves some understanding of the content of the doc...
Considering the explosive growth of data, the increased amount of text data’s effect on the performa...
[[abstract]]Recently research on text mining has attracted lots of attention from both industrial an...
[[abstract]]The process of text categorization involves some understanding of the content of the doc...
Words and n-grams are commonly used Chinese text representing units and are proved to be good featur...
Text categorization is defined as the task of assigning pre-defined category labels to new documents...
Words and n-grams are commonly used Chinese text representing units and are proved to be good featur...
Text categorization task always suffers from a high dimension problem, which leads the learning syst...
Continuous expansion of digital libraries and online news, the huge amount of text documents is exis...
对基于中文的Web文本分类技术进行了研究,介绍了web文本分类的基本过程和Web文本预处理及文本特征选取的方法,重点介绍了一种常用的基于内容的分类算法KNN。最后通过实验测试了使用KNN算法的中文We...
The feature selection is an important part in automatic classification. In this paper, we use the Ho...
The feature selection is an important part in automatic classification. In this paper, we use the Ho...
三重大学大学院工学研究科博士前期課程情報工学専攻Automatic text classification (ATC) is the task to automatically assign one ...
This paper is a comparative study on representing units in Chinese text categorization. Several kind...
Abstract. The paper describes an application of classification algorithms to the text categorization...
[[abstract]]The process of text categorization involves some understanding of the content of the doc...
Considering the explosive growth of data, the increased amount of text data’s effect on the performa...
[[abstract]]Recently research on text mining has attracted lots of attention from both industrial an...
[[abstract]]The process of text categorization involves some understanding of the content of the doc...
Words and n-grams are commonly used Chinese text representing units and are proved to be good featur...
Text categorization is defined as the task of assigning pre-defined category labels to new documents...
Words and n-grams are commonly used Chinese text representing units and are proved to be good featur...
Text categorization task always suffers from a high dimension problem, which leads the learning syst...
Continuous expansion of digital libraries and online news, the huge amount of text documents is exis...
对基于中文的Web文本分类技术进行了研究,介绍了web文本分类的基本过程和Web文本预处理及文本特征选取的方法,重点介绍了一种常用的基于内容的分类算法KNN。最后通过实验测试了使用KNN算法的中文We...
The feature selection is an important part in automatic classification. In this paper, we use the Ho...
The feature selection is an important part in automatic classification. In this paper, we use the Ho...
三重大学大学院工学研究科博士前期課程情報工学専攻Automatic text classification (ATC) is the task to automatically assign one ...
This paper is a comparative study on representing units in Chinese text categorization. Several kind...
Abstract. The paper describes an application of classification algorithms to the text categorization...