Abstract: This paper provides an analysis of multi-class e-mail categorization per-formance. In order to investigate this issue, the quality of various classification al-gorithms based on two distinct document representation formalisms is compared. In particular, both a standard word-based document representation as well as a character n-gram document representation is used. The latter is regarded as highly noise-tolerant and was originally proposed for automatic language identification and as a convenient means for producing compact document indices. Furthermore the impact of using available e-mail specific meta-information on classification performance is explored and the findings are presented.
In this note, we present results concerning the theory and practice of determining for a given docum...
Thesis (M.S.)--University of Hawaii at Manoa, 2008.Includes bibliographical references (leaves 56-58...
Conventionally, document classification researches focus on improving the learning capabilities of c...
This paper is about using existing directory structures on the file system as models for e-mail clas...
Abstract In this paper we study the effectiveness of using a phrase-based representation in e-mail c...
My project is an implementation of 'Naive Based Algorithm' to classify E-mails. The main idea of the...
In literature, many feature types and learning algorithms are proposed for document classification. ...
With the growing of the Web, we have huge amounts of texts that could be analysed. Unfortunately, mo...
Many important application areas of text classifiers demand high precision and it is common to compa...
{orjeong, thlee, sglee} @ europa.snu.ac.kr Abstract. Electronic documents such as e-catalogs, e-mai...
Two methods for learning text classifiers are compared on classification problems that might arise i...
In this paper we perform a comparative analysis of three models for a feature representation of text...
Classifying e-mails into distinct labels can have a great impact on customer support. By using machi...
The increasing volume of unsolicited mass e-mail (otherwise called spam) has generated a need for re...
Text categorization is a fundamental task in document processing, allowing the automated handling of...
In this note, we present results concerning the theory and practice of determining for a given docum...
Thesis (M.S.)--University of Hawaii at Manoa, 2008.Includes bibliographical references (leaves 56-58...
Conventionally, document classification researches focus on improving the learning capabilities of c...
This paper is about using existing directory structures on the file system as models for e-mail clas...
Abstract In this paper we study the effectiveness of using a phrase-based representation in e-mail c...
My project is an implementation of 'Naive Based Algorithm' to classify E-mails. The main idea of the...
In literature, many feature types and learning algorithms are proposed for document classification. ...
With the growing of the Web, we have huge amounts of texts that could be analysed. Unfortunately, mo...
Many important application areas of text classifiers demand high precision and it is common to compa...
{orjeong, thlee, sglee} @ europa.snu.ac.kr Abstract. Electronic documents such as e-catalogs, e-mai...
Two methods for learning text classifiers are compared on classification problems that might arise i...
In this paper we perform a comparative analysis of three models for a feature representation of text...
Classifying e-mails into distinct labels can have a great impact on customer support. By using machi...
The increasing volume of unsolicited mass e-mail (otherwise called spam) has generated a need for re...
Text categorization is a fundamental task in document processing, allowing the automated handling of...
In this note, we present results concerning the theory and practice of determining for a given docum...
Thesis (M.S.)--University of Hawaii at Manoa, 2008.Includes bibliographical references (leaves 56-58...
Conventionally, document classification researches focus on improving the learning capabilities of c...