Real-time email classification is a challenging task because of its online nature, subject to concept-drift. Identifying spam, where only two labels exist, has received great attention in the literature. We are nevertheless interested in classification involving multiple folders, which is an additional source of complexity. Moreover, neither cross-validation nor other sampling procedures are suitable for data streams evaluation. Therefore, other metrics, like the prequential error, have been proposed. However, the prequential error poses some problems, which can be alleviated by using mechanisms such as fading factors. In this paper we present GNUsmail, an open-source extensible framework for email classification, and focus on its ability to ...
As we know email is an effective tool for communication and it is the fastest way to send informatio...
The paper elaborates on how text analysis influences classification—a key part of the spam-filtering...
Since the 90's, different machine learning methods were investigated and applied to the email classi...
Real-time email classification is a challenging task because of its online nature, subject to concept...
Real-time classification of massive email data is a challenging task that presents its own particula...
Abstract. Real-time classification of massive email data is a chal-lenging task that presents its ow...
Email has become one of the fastest and most economical forms of communication. However, the increas...
Machine learning and data mining can be effectively used to model, classify and discover interesting...
The performance of two online linear classifiers - the Perceptron and Littlestone’s Winnow – is expl...
Email is one of the most ubiquitous and pervasive application used on a daily basis by millions of p...
The increasing volume of unsolicited mass e-mail (otherwise called spam) has generated a need for re...
In the last decade, the Internet email has become one of the primary method of communication used by...
Spam has been studied and dealt with extensively in the email, web and, recently, the blog domain. R...
Spam identification is crucial in implementing an effective email filtering system, while spam recog...
[[abstract]]The problem of spam overflow has not been solved completely. Many anti-spam techniques h...
As we know email is an effective tool for communication and it is the fastest way to send informatio...
The paper elaborates on how text analysis influences classification—a key part of the spam-filtering...
Since the 90's, different machine learning methods were investigated and applied to the email classi...
Real-time email classification is a challenging task because of its online nature, subject to concept...
Real-time classification of massive email data is a challenging task that presents its own particula...
Abstract. Real-time classification of massive email data is a chal-lenging task that presents its ow...
Email has become one of the fastest and most economical forms of communication. However, the increas...
Machine learning and data mining can be effectively used to model, classify and discover interesting...
The performance of two online linear classifiers - the Perceptron and Littlestone’s Winnow – is expl...
Email is one of the most ubiquitous and pervasive application used on a daily basis by millions of p...
The increasing volume of unsolicited mass e-mail (otherwise called spam) has generated a need for re...
In the last decade, the Internet email has become one of the primary method of communication used by...
Spam has been studied and dealt with extensively in the email, web and, recently, the blog domain. R...
Spam identification is crucial in implementing an effective email filtering system, while spam recog...
[[abstract]]The problem of spam overflow has not been solved completely. Many anti-spam techniques h...
As we know email is an effective tool for communication and it is the fastest way to send informatio...
The paper elaborates on how text analysis influences classification—a key part of the spam-filtering...
Since the 90's, different machine learning methods were investigated and applied to the email classi...