Abstract. In text classification, providing an efficient classifier even if the num-ber of documents involved in the learning step is small remains an important is-sue. In this paper we evaluate the performance of traditional classification meth-ods to better evaluate their limitation in the learning phase when dealing with small amount of documents. We thus propose a new way for weighting features which are used for classifying. These features have been integrated in two well known classifiers: Class-Feature-Centroid and Naı̈ve Bayes, and evaluations have been performed on two real datasets. We have also investigated the influence on parameters such as number of classes, documents or words in the classification. Experiments have shown the ...
Au quotidien, le réflexe de classifier est omniprésent et inconscient. Par exemple dans le processus...
Text classification is a wide research field with existing ready-to-use solutions for supervised tra...
As the digital age pushes forward, data and document size have been increasing rapidly. A more effic...
In text classification, providing an efficient classifier even if the number of documents involved i...
International audienceIn text classification, providing an efficient classifier even if the number o...
The natural distribution of textual data used in text classification is often imbalanced. Categories...
Feature selection methods are often applied in the context of document classification. They are part...
In this paper we presented a lot of experiments that examine how the particular parts of the documen...
This work deals with document classification. It is a supervised learning method (it needs a labeled...
In recent years we have seen a tremendous growth in the volume of online text documents available on...
Document classification methods for a small number of training documents are discussed using mathema...
Abstract. In recent years we have seen a tremendous growth in the volume of text documents available...
Selecting features that represent a specific class is important to achieve a high text classificatio...
Text categorization is the task of discovering the category or class text documents belongs to, or i...
Text classification is the process in which text document is assigned to one or more predefined cate...
Au quotidien, le réflexe de classifier est omniprésent et inconscient. Par exemple dans le processus...
Text classification is a wide research field with existing ready-to-use solutions for supervised tra...
As the digital age pushes forward, data and document size have been increasing rapidly. A more effic...
In text classification, providing an efficient classifier even if the number of documents involved i...
International audienceIn text classification, providing an efficient classifier even if the number o...
The natural distribution of textual data used in text classification is often imbalanced. Categories...
Feature selection methods are often applied in the context of document classification. They are part...
In this paper we presented a lot of experiments that examine how the particular parts of the documen...
This work deals with document classification. It is a supervised learning method (it needs a labeled...
In recent years we have seen a tremendous growth in the volume of online text documents available on...
Document classification methods for a small number of training documents are discussed using mathema...
Abstract. In recent years we have seen a tremendous growth in the volume of text documents available...
Selecting features that represent a specific class is important to achieve a high text classificatio...
Text categorization is the task of discovering the category or class text documents belongs to, or i...
Text classification is the process in which text document is assigned to one or more predefined cate...
Au quotidien, le réflexe de classifier est omniprésent et inconscient. Par exemple dans le processus...
Text classification is a wide research field with existing ready-to-use solutions for supervised tra...
As the digital age pushes forward, data and document size have been increasing rapidly. A more effic...