Financiado para publicación en acceso aberto: Universidade de Vigo/CISUGThis study addresses the usage of different features to complement synset-based and bag-of-words representations of texts in the context of using classical ML approaches for spam filtering (Ferrara, 2019). Despite the existence of a large number of complementary features, in order to improve the applicability of this study, we have selected only those that can be computed regardless of the communication channel used to distribute content. Feature evaluation has been performed using content distributed through different channels (social networks and email) and classifiers (Adaboost, Flexible Bayes, Naïve Bayes, Random Forests, and SVMs). The results have revealed the use...
Masteroppgave i informasjons- og kommunikasjonsteknologi 2010 – Universitetet i Agder, GrimstadTradi...
The steady growth and popularization of the Web has led spammers to develop techniques to circumvent...
We present a classification model for semi-structured documents based on statistical language modell...
Financiado para publicación en acceso aberto: Universidade de Vigo/CISUGThis study addresses the usa...
Nowadays, e-mail spam is not a novelty, but it is still an important problem with a high impact on t...
In this paper we analyse the strengths and weaknesses of the mainly used feature selection methods i...
The Internet emerged as a powerful infrastructure for the worldwide communication and interaction of...
In this paper, we study the usability of linguistic features in the context of statistical-based mac...
The paper elaborates on how text analysis influences classification—a key part of the spam-filtering...
The rapid growth of unsolicited and unwanted messages has inspired the development of many anti-spam...
A solution to spam emails remains elusive despite over a decade long research efforts on spam filter...
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Conselho Nacional de Desenvolvimento Ci...
Spam is serious problem that affects email users (e.g. phishing attacks, viruses and time spent read...
Spam filtering poses a special problem in text categorization, of which the defining characteristic ...
In the modern world, email communication defines itself as the most used technology for exchanging m...
Masteroppgave i informasjons- og kommunikasjonsteknologi 2010 – Universitetet i Agder, GrimstadTradi...
The steady growth and popularization of the Web has led spammers to develop techniques to circumvent...
We present a classification model for semi-structured documents based on statistical language modell...
Financiado para publicación en acceso aberto: Universidade de Vigo/CISUGThis study addresses the usa...
Nowadays, e-mail spam is not a novelty, but it is still an important problem with a high impact on t...
In this paper we analyse the strengths and weaknesses of the mainly used feature selection methods i...
The Internet emerged as a powerful infrastructure for the worldwide communication and interaction of...
In this paper, we study the usability of linguistic features in the context of statistical-based mac...
The paper elaborates on how text analysis influences classification—a key part of the spam-filtering...
The rapid growth of unsolicited and unwanted messages has inspired the development of many anti-spam...
A solution to spam emails remains elusive despite over a decade long research efforts on spam filter...
Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Conselho Nacional de Desenvolvimento Ci...
Spam is serious problem that affects email users (e.g. phishing attacks, viruses and time spent read...
Spam filtering poses a special problem in text categorization, of which the defining characteristic ...
In the modern world, email communication defines itself as the most used technology for exchanging m...
Masteroppgave i informasjons- og kommunikasjonsteknologi 2010 – Universitetet i Agder, GrimstadTradi...
The steady growth and popularization of the Web has led spammers to develop techniques to circumvent...
We present a classification model for semi-structured documents based on statistical language modell...