Selecting features that represent a specific class is important to achieve a high text classification performance. The core and critical part of any text feature selection method is the weighting function. Most term weighting methods only consider document level when calculating a term weight and do not consider the distribution of features among different classes. Such an approach does not accurately reflect the specificity of each individual term that can discriminate between the positive and negative documents in the document collection because of the numerous uncertainties in text documents. To address this problem, we propose an innovative and effective feature-weighing method based on three-way decisions to reduce uncertainties in sel...
With the rapid growth of textual content on the Internet, automatic text categorization is a compara...
Many feature selection methods have been proposed for text categorization. However, their performanc...
Text classification is widely used in applications rang-ing from e-mail filtering to review classifi...
Selecting features that represent a specific class is important to achieve a high text classificatio...
In the automated text classification, tfidf is often considered as the default term weighting scheme...
The natural distribution of textual data used in text classification is often imbalanced. Categories...
Text classification is a core technique for text mining and information retrieval. It has been appli...
This paper describes a new approach for estimating term weights in a document, and shows how the new...
Feature selection is an important stage in any text mining classification techniques. In this disser...
This paper proposes a term weighting scheme, categorical term descriptor (CTD), for feature selectio...
This paper proposes a local feature selection (FS) measure namely, Categorical Descriptor Term (CTD)...
Within text categorization and other data mining tasks, the use of suitable methods for term weighti...
Text categorization is an important application of machine learning to the field of document informa...
Text categorization is the task of discovering the category or class text documents belongs to, or i...
Abstract. In this paper, we propose a probabilistic approach to fea-ture selection for multi-class t...
With the rapid growth of textual content on the Internet, automatic text categorization is a compara...
Many feature selection methods have been proposed for text categorization. However, their performanc...
Text classification is widely used in applications rang-ing from e-mail filtering to review classifi...
Selecting features that represent a specific class is important to achieve a high text classificatio...
In the automated text classification, tfidf is often considered as the default term weighting scheme...
The natural distribution of textual data used in text classification is often imbalanced. Categories...
Text classification is a core technique for text mining and information retrieval. It has been appli...
This paper describes a new approach for estimating term weights in a document, and shows how the new...
Feature selection is an important stage in any text mining classification techniques. In this disser...
This paper proposes a term weighting scheme, categorical term descriptor (CTD), for feature selectio...
This paper proposes a local feature selection (FS) measure namely, Categorical Descriptor Term (CTD)...
Within text categorization and other data mining tasks, the use of suitable methods for term weighti...
Text categorization is an important application of machine learning to the field of document informa...
Text categorization is the task of discovering the category or class text documents belongs to, or i...
Abstract. In this paper, we propose a probabilistic approach to fea-ture selection for multi-class t...
With the rapid growth of textual content on the Internet, automatic text categorization is a compara...
Many feature selection methods have been proposed for text categorization. However, their performanc...
Text classification is widely used in applications rang-ing from e-mail filtering to review classifi...