A new approach to the Text Categorization problem is here presented. It is called Gaussian Weighting and it is a supervised learning algorithm that, during the training phase, estimates two very simple and easily computable statistics which are: the Presence \emph{P}, how much a term \emph{t} is present in a category \emph{c}; the Expressiveness \emph{E}, how much \emph{t} is present outside \emph{c} in the rest of the domain. Once the system has learned this information, a Gaussian function is shaped for each term of a category, in order to assign the term a weight that estimates the level of its importance for that particular category. We tested our learning method on the task of single-label classification using the Reu...
Supervised text categorization is a machine learning task where a predefined category label is autom...
Automatic text categorization is the task of assigning natural language text documents to predefined...
We propose a text categorization bootstrapping algorithm in which categories are described by releva...
Within text categorization and other data mining tasks, the use of suitable methods for term weighti...
The natural distribution of textual data used in text classification is often imbalanced. Categories...
This paper introduces a term weighting method for text categorization based on smoothing ideas borro...
This paper discusses a new weighting method for text analyzing from the view point of supervised lea...
We present an approach to text categorization using machine learning techniques. The approach is dev...
In text analysis tasks like text classification and sentiment analysis, the careful choice of term w...
Text categorization is an important application of machine learning to the field of document informa...
Abstract- In this paper, various term weighting methods for text categorization has been discussed. ...
This paper focuses on a comparative evaluation of a wide-range of text categorization methods, inclu...
The increasing availability of digital documents in the last decade has prompted the development of ...
Automatic text categorization is the task of assigning natural language text documents to predefined...
This paper examines the use of inductive learning to categorize natural language documents into pred...
Supervised text categorization is a machine learning task where a predefined category label is autom...
Automatic text categorization is the task of assigning natural language text documents to predefined...
We propose a text categorization bootstrapping algorithm in which categories are described by releva...
Within text categorization and other data mining tasks, the use of suitable methods for term weighti...
The natural distribution of textual data used in text classification is often imbalanced. Categories...
This paper introduces a term weighting method for text categorization based on smoothing ideas borro...
This paper discusses a new weighting method for text analyzing from the view point of supervised lea...
We present an approach to text categorization using machine learning techniques. The approach is dev...
In text analysis tasks like text classification and sentiment analysis, the careful choice of term w...
Text categorization is an important application of machine learning to the field of document informa...
Abstract- In this paper, various term weighting methods for text categorization has been discussed. ...
This paper focuses on a comparative evaluation of a wide-range of text categorization methods, inclu...
The increasing availability of digital documents in the last decade has prompted the development of ...
Automatic text categorization is the task of assigning natural language text documents to predefined...
This paper examines the use of inductive learning to categorize natural language documents into pred...
Supervised text categorization is a machine learning task where a predefined category label is autom...
Automatic text categorization is the task of assigning natural language text documents to predefined...
We propose a text categorization bootstrapping algorithm in which categories are described by releva...