Spam filtering is a text classification task to which Case-Based Reasoning (CBR) has been successfully applied. We describe the ECUE system, which classifies emails using a feature-based form of textual CBR. Then, we describe an alternative way to compute the distances between cases in a feature-free fashion, using a distance measure based on text compression. This distance measure has the advantages of having no set-up costs and being resilient to concept drift. We report an empirical comparison, which shows the feature-free approach to be more accurate than the feature-based system. These results are fairly robust over different compression algorithms in that we find that the accuracy when using a Lempel-Ziv compressor (GZip) is approxima...
Recently the number of undesirable messages coming to e-mail has strongly increased. As spam has cha...
Emails are a popular and preferred way of written communication in our daily life. The problem with ...
The Internet has touched every part of our lives, including our interactions and communications. Pri...
Spam filtering is a text classification task to which Case-Based Reasoning (CBR) has been successful...
Spam filtering is a text classification task to which Case-Based Reasoning (CBR) has been successful...
n this paper, we compare case-based spam filters, focusing on their resilience to concept drift. In ...
Because of the changing nature of spam, a spam filtering system that uses machine learning will need ...
Because of the changing nature of spam, a spam filtering system that uses machine learning will need...
This paper presents a comparison between two alternative strategies for addressing feature selection...
In this paper we propose a novel feature selection method able to handle concept drift problems in s...
Spam filtering is a particularly challenging machine learning task as the data distribution and conc...
While text classification has been identified for some time as a promising application area for Arti...
As the vast increases of the electronic mail (email) usages continue, spam (unsolicited bulk mail) h...
In this paper we analyse the strengths and weaknesses of the mainly used feature selection methods i...
n this paper we show an instance-based reasoning e-mail filtering model that outperforms classical m...
Recently the number of undesirable messages coming to e-mail has strongly increased. As spam has cha...
Emails are a popular and preferred way of written communication in our daily life. The problem with ...
The Internet has touched every part of our lives, including our interactions and communications. Pri...
Spam filtering is a text classification task to which Case-Based Reasoning (CBR) has been successful...
Spam filtering is a text classification task to which Case-Based Reasoning (CBR) has been successful...
n this paper, we compare case-based spam filters, focusing on their resilience to concept drift. In ...
Because of the changing nature of spam, a spam filtering system that uses machine learning will need ...
Because of the changing nature of spam, a spam filtering system that uses machine learning will need...
This paper presents a comparison between two alternative strategies for addressing feature selection...
In this paper we propose a novel feature selection method able to handle concept drift problems in s...
Spam filtering is a particularly challenging machine learning task as the data distribution and conc...
While text classification has been identified for some time as a promising application area for Arti...
As the vast increases of the electronic mail (email) usages continue, spam (unsolicited bulk mail) h...
In this paper we analyse the strengths and weaknesses of the mainly used feature selection methods i...
n this paper we show an instance-based reasoning e-mail filtering model that outperforms classical m...
Recently the number of undesirable messages coming to e-mail has strongly increased. As spam has cha...
Emails are a popular and preferred way of written communication in our daily life. The problem with ...
The Internet has touched every part of our lives, including our interactions and communications. Pri...