Support Vector Machines have been used successfully to classify text documents into sets of concepts. However, typically, linguistic information is not being used in the classification process or its use has not been fully evaluated. We apply and evaluate two basic linguistic procedures (stop-word removal and stemming/lemmatization) to the multilabel text classification problem. These procedures are applied to the Reuters dataset and to the Portuguese juridical documents from Supreme Courts and Attorney General’s Office
We carried out a series of experiments on text classification using multi-word features. A hand-craf...
Classification plays a vital role in many information management and retrieval tasks . Text classifi...
In this paper, we address the problem of dealing with a large collection of data and propose a met...
Portuguese juridical documents from Supreme Courts and the Attorney General’s Office are manually cl...
Text classification is an important task in the legal domain. In fact, most of the legal information...
This paper examines the role of various linguistic structures on text classification applying the st...
This paper performs a study on the pre-processing phase of the automated text classification problem...
Support Vector Machines have been applied to text classification with great success. In this paper, ...
With the massive growth of the use of computers and the internet in the past decade, there has been ...
We created and analyzed a text classification dataset from freely-available web documents from the U...
Support Vector Machines (SVM) can classify objects described by an effectively infinite-dimensional ...
Abstract. This paper proposes and evaluates the use of linguistic in-formation in the pre-processing...
Modern information society is facing the challenge of handling massive volume of online documents, n...
Text categorization is the process of sorting text documents into one or more predefined categories ...
This paper explores the use of Support Vector Machines (SVMs) for learning text classifiers from ex...
We carried out a series of experiments on text classification using multi-word features. A hand-craf...
Classification plays a vital role in many information management and retrieval tasks . Text classifi...
In this paper, we address the problem of dealing with a large collection of data and propose a met...
Portuguese juridical documents from Supreme Courts and the Attorney General’s Office are manually cl...
Text classification is an important task in the legal domain. In fact, most of the legal information...
This paper examines the role of various linguistic structures on text classification applying the st...
This paper performs a study on the pre-processing phase of the automated text classification problem...
Support Vector Machines have been applied to text classification with great success. In this paper, ...
With the massive growth of the use of computers and the internet in the past decade, there has been ...
We created and analyzed a text classification dataset from freely-available web documents from the U...
Support Vector Machines (SVM) can classify objects described by an effectively infinite-dimensional ...
Abstract. This paper proposes and evaluates the use of linguistic in-formation in the pre-processing...
Modern information society is facing the challenge of handling massive volume of online documents, n...
Text categorization is the process of sorting text documents into one or more predefined categories ...
This paper explores the use of Support Vector Machines (SVMs) for learning text classifiers from ex...
We carried out a series of experiments on text classification using multi-word features. A hand-craf...
Classification plays a vital role in many information management and retrieval tasks . Text classifi...
In this paper, we address the problem of dealing with a large collection of data and propose a met...