This paper describes an algorithm for document representation in a reduced vectorial space by a process of feature extraction. The algorithm is applied and evaluated in the context of the supervised classification of news articles from the collection of Le Monde newspaper issued in the years 2003 and 2004. We are generating a document representation (or profile), in a space of 800 dimensions, represented by semantic tags from a machine-readable dictionary. We are dealing with two issues: the synonymy handled by thematic conflation and polysemy for which we have developed a statistical method for word-sense disambiguation. We propose four variants for the profile generation (of a document) depending on whether a recursive system is used or ...
In this note, we present results concerning the theory and practice of determining for a given docum...
(Automatic) document classification is generally defined as content-based assignment of one or more ...
This work deals with document classification. It is a supervised learning method (it needs a labeled...
This paper describes an algorithm for document representation in a reduced vectorial space by a proc...
Exploiting multimedia documents leads to representation problems of the textual and visual informati...
Automatic text classification is the process of automatically classifying text documents into pre-de...
Conventionally, document classification researches focus on improving the learning capabilities of c...
This paper investigates the problem of text classification. The task of text classification is to as...
(Automatic) document classification is generally defined as content-based assignment of one or more ...
This paper presents an extension of prior work by Michael D. Lee on psychologically plausible text c...
Dimensionality reduction (DR) through feature extraction (FE) is desirable for efficient and effecti...
In this paper we perform a comparative analysis of three models for a feature representation of text...
This paper presents an extension of prior work by Michael D. Lee on psychologically plausible text c...
This thesis follows up text categorization. In the first part are described several chosen algorithm...
In practical text classification tasks, the ability to interpret the classification result is as imp...
In this note, we present results concerning the theory and practice of determining for a given docum...
(Automatic) document classification is generally defined as content-based assignment of one or more ...
This work deals with document classification. It is a supervised learning method (it needs a labeled...
This paper describes an algorithm for document representation in a reduced vectorial space by a proc...
Exploiting multimedia documents leads to representation problems of the textual and visual informati...
Automatic text classification is the process of automatically classifying text documents into pre-de...
Conventionally, document classification researches focus on improving the learning capabilities of c...
This paper investigates the problem of text classification. The task of text classification is to as...
(Automatic) document classification is generally defined as content-based assignment of one or more ...
This paper presents an extension of prior work by Michael D. Lee on psychologically plausible text c...
Dimensionality reduction (DR) through feature extraction (FE) is desirable for efficient and effecti...
In this paper we perform a comparative analysis of three models for a feature representation of text...
This paper presents an extension of prior work by Michael D. Lee on psychologically plausible text c...
This thesis follows up text categorization. In the first part are described several chosen algorithm...
In practical text classification tasks, the ability to interpret the classification result is as imp...
In this note, we present results concerning the theory and practice of determining for a given docum...
(Automatic) document classification is generally defined as content-based assignment of one or more ...
This work deals with document classification. It is a supervised learning method (it needs a labeled...