This paper performs a study on the pre-processing phase of the automated text classification problem. We use the linear Support Vector Machine paradigm applied to datasets written in the English and the European Portuguese languages – the Reuters and the Portuguese Attorney General’s Office datasets, respectively. The study can be seen as a search, for the best document representa- tion, in three different axes: the feature reduction (using linguistic in- formation), the feature selection (using word frequencies) and the term weighting (using information retrieval measures)
Text mining is drawing enormous attention in this era as there is a huge amount of text data getting...
One of the several benefits of text classification is to automatically assign document in predefined ca...
This thesis presents the application of various classification techniques on text documents. Since t...
Support Vector Machines have been used successfully to classify text documents into sets of concepts...
Support Vector Machines have been applied to text classification with great success. In this paper, ...
This paper examines the role of various linguistic structures on text classification applying the st...
Abstract — Text Classification, also known as text categorization, is the task of automatically allo...
Abstract. This paper proposes and evaluates the use of linguistic in-formation in the pre-processing...
Text categorization is the process of sorting text documents into one or more predefined categories ...
With the massive growth of the use of computers and the internet in the past decade, there has been ...
Over the course of the previous two decades, there has been a rise in the quantity of text documents...
Text classification is an important task in the legal domain. In fact, most of the legal information...
The Text mining and Data mining supports different kinds of algorithms for classification of large d...
Support Vector Machines (SVM) can classify objects described by an effectively infinite-dimensional ...
Abstract. A universal problem with text classification has a problem due to the high dimensionality ...
Text mining is drawing enormous attention in this era as there is a huge amount of text data getting...
One of the several benefits of text classification is to automatically assign document in predefined ca...
This thesis presents the application of various classification techniques on text documents. Since t...
Support Vector Machines have been used successfully to classify text documents into sets of concepts...
Support Vector Machines have been applied to text classification with great success. In this paper, ...
This paper examines the role of various linguistic structures on text classification applying the st...
Abstract — Text Classification, also known as text categorization, is the task of automatically allo...
Abstract. This paper proposes and evaluates the use of linguistic in-formation in the pre-processing...
Text categorization is the process of sorting text documents into one or more predefined categories ...
With the massive growth of the use of computers and the internet in the past decade, there has been ...
Over the course of the previous two decades, there has been a rise in the quantity of text documents...
Text classification is an important task in the legal domain. In fact, most of the legal information...
The Text mining and Data mining supports different kinds of algorithms for classification of large d...
Support Vector Machines (SVM) can classify objects described by an effectively infinite-dimensional ...
Abstract. A universal problem with text classification has a problem due to the high dimensionality ...
Text mining is drawing enormous attention in this era as there is a huge amount of text data getting...
One of the several benefits of text classification is to automatically assign document in predefined ca...
This thesis presents the application of various classification techniques on text documents. Since t...