Abstract—Web spam is a serious problem for search engines be-cause the quality of their results can be severely degraded by the presence of this kind of page. In this paper, we present an effi-cient spam detection system based on a classifier that combines new link-based features with language-model (LM)-based ones. These features are not only related to quantitative data extracted from the Web pages, but also to qualitative properties, mainly of the page links. We consider, for instance, the ability of a search engine to find, using information provided by the page for a given link, the page that the link actually points at. This can be regarded as indicative of the link reliability. We also check the coherence be-tween a page and another ...
High ranking of a Web site in search engines can be directly correlated to high revenues. This ampli...
How to effectively protect against spam on search ranking results is an important issue for contempo...
The steady growth and popularization of the Web has led spammers to develop techniques to circumvent...
This paper applies a language model approach to different sources of information extracted from a We...
We propose link-based techniques for automatic detection of Web spam, a term referring to pages whic...
We perform a statistical analysis of a large collection of Web pages, focusing on spam detection. We...
Search engine is critical in people’s daily life because it determines the information quality peopl...
We perform a statistical analysis of a large collection of Web pages, focusing on spam detection. We...
Link spam is created with the intention of boosting one target’s rank in exchange of business profit...
Web spam detection is a critical issue in today’s rapidly growing usage of the Internet and the Worl...
In this paper, we study the usability of linguistic features in the context of statistical-based mac...
Web spammers aim to obtain higher ranks for their web pages by including spam contents that deceive ...
Abstract. The page rank of a commercial web site has an enormous economic impact because it directly...
We study the usability of linguistic features in theWeb spam classification task. The features were ...
Traditional content-based e-mail spam filtering takes into ac-count content of e-mail messages and a...
High ranking of a Web site in search engines can be directly correlated to high revenues. This ampli...
How to effectively protect against spam on search ranking results is an important issue for contempo...
The steady growth and popularization of the Web has led spammers to develop techniques to circumvent...
This paper applies a language model approach to different sources of information extracted from a We...
We propose link-based techniques for automatic detection of Web spam, a term referring to pages whic...
We perform a statistical analysis of a large collection of Web pages, focusing on spam detection. We...
Search engine is critical in people’s daily life because it determines the information quality peopl...
We perform a statistical analysis of a large collection of Web pages, focusing on spam detection. We...
Link spam is created with the intention of boosting one target’s rank in exchange of business profit...
Web spam detection is a critical issue in today’s rapidly growing usage of the Internet and the Worl...
In this paper, we study the usability of linguistic features in the context of statistical-based mac...
Web spammers aim to obtain higher ranks for their web pages by including spam contents that deceive ...
Abstract. The page rank of a commercial web site has an enormous economic impact because it directly...
We study the usability of linguistic features in theWeb spam classification task. The features were ...
Traditional content-based e-mail spam filtering takes into ac-count content of e-mail messages and a...
High ranking of a Web site in search engines can be directly correlated to high revenues. This ampli...
How to effectively protect against spam on search ranking results is an important issue for contempo...
The steady growth and popularization of the Web has led spammers to develop techniques to circumvent...