Text classification is an important task of data mining. Existing algorithms, which based on vector space models, does not considered concept similarities among words, so the accuracy of traditional text classification cannot guarantee. To solve the problem, this paper proposes a new text classification algorithm in Chinese text processing based on concept similarity. The contributions of the paper include: (1) proposing a new similarity-computing model between words or sentences based on concept similarity; (2) applying the algorithm successfully in the text classification of WEB news; (3).analyzing the similarity computing formulas systematically in theory; (4).proving that the algorithm has much more accurate than traditional k-NN algori...
Classification plays a vital role in many information management and retrieval tasks . Text classifi...
Abstract. The feature selection is an important part in automatic text classification. In this paper...
We present a comprehensive study of computing similarity between texts. We start from the observatio...
In recent years, there has been an increasing interest in data clustering of short documents. Existi...
Intelligent communication processing in English aims to obtain effective information from unstructur...
This paper focuses on the high dimensional text problems encountered in text classification.Document...
Abstract—Text Classification is a challenging and a red hot field in the current scenario and has gr...
The similarity between objects is the core research area of data mining. In order to reduce the inte...
Abstract:Most of the common techniques of text mining are based on the statistical analysis of the t...
The exploitation of syntactic structures and semantic background knowledge has always been an appeal...
Measuring document similarity has shown its fundamental utilization in various text mining applicati...
Abstract—In order to improve the accuracy of short text similarity calculation, this paper presents ...
The massive amount of information from the internet has revolutionized the field of natural language...
Based on the strong classification feature recognition algorithm, the calculation algorithm of a tex...
Now-a-days, the documents similarity measuring plays an important role in text related researches. T...
Classification plays a vital role in many information management and retrieval tasks . Text classifi...
Abstract. The feature selection is an important part in automatic text classification. In this paper...
We present a comprehensive study of computing similarity between texts. We start from the observatio...
In recent years, there has been an increasing interest in data clustering of short documents. Existi...
Intelligent communication processing in English aims to obtain effective information from unstructur...
This paper focuses on the high dimensional text problems encountered in text classification.Document...
Abstract—Text Classification is a challenging and a red hot field in the current scenario and has gr...
The similarity between objects is the core research area of data mining. In order to reduce the inte...
Abstract:Most of the common techniques of text mining are based on the statistical analysis of the t...
The exploitation of syntactic structures and semantic background knowledge has always been an appeal...
Measuring document similarity has shown its fundamental utilization in various text mining applicati...
Abstract—In order to improve the accuracy of short text similarity calculation, this paper presents ...
The massive amount of information from the internet has revolutionized the field of natural language...
Based on the strong classification feature recognition algorithm, the calculation algorithm of a tex...
Now-a-days, the documents similarity measuring plays an important role in text related researches. T...
Classification plays a vital role in many information management and retrieval tasks . Text classifi...
Abstract. The feature selection is an important part in automatic text classification. In this paper...
We present a comprehensive study of computing similarity between texts. We start from the observatio...