Splog is the key challenge in the access of blogosphere. Existing splog-filtering methods are restricted to the way for traditional web spam filtering, without considering the characteristics of blogs. Inspired by the observation that fake writers (writers of splogs) have striking higher consistent writing behavior than real writers (writers of legitimate blogs), we propose to detect splogs by distinguishing fake writers from real writers. To measure how consistent the writing behavior is, we propose the consistency-based features derived from writing interval, writing structure and writing topic. Then we designed a splog-filtering system which can use the consistency-based features effectively and flexibly. The experimental results on Blog...
Session 5Understanding customers is crucial to companies’ decision-making. With the advent of Web 2....
Blogs are personal online diaries, and a relatively recent form of computer-mediated communication. ...
Part 2: Classification – Pattern Recognition (CLASPR)International audienceAuthorship attribution is...
Spam blogs (splogs) have become a major problem in the increasingly popular blogosphere. Splogs are ...
Weblogs, or blogs are an important new way to publish information, engage in discussions, and form c...
This paper focuses on spam blog (splog) detection. Blogs are highly popular, new media social commun...
Spam blog is a subset of blog which contains nothing more than stolen materials and inauthentic text...
Weblogs, or blogs have become an important new way to publish information, engage in discussions and...
Topical noise in blogs arises when bloggers digress from the central topical thrust of their blogs. ...
People use weblogs to express thoughts, present ideas and share knowledge. However, weblogs can also...
This paper studies how to reduce the amount of human su-pervision for identifying splogs / authentic...
The ease of posting comments and links in blogs has attracted spammers as an alternative venue to co...
A blog, or weblog, is an online diary whose writer is known as a blogger. Many bloggers choose to pu...
Abstract—Current ranking algorithms, such as PageRank, Technorati authority, and BI-Impact, favor bl...
The explosion of blogs on the Web in recent years has fostered research interest in the Information...
Session 5Understanding customers is crucial to companies’ decision-making. With the advent of Web 2....
Blogs are personal online diaries, and a relatively recent form of computer-mediated communication. ...
Part 2: Classification – Pattern Recognition (CLASPR)International audienceAuthorship attribution is...
Spam blogs (splogs) have become a major problem in the increasingly popular blogosphere. Splogs are ...
Weblogs, or blogs are an important new way to publish information, engage in discussions, and form c...
This paper focuses on spam blog (splog) detection. Blogs are highly popular, new media social commun...
Spam blog is a subset of blog which contains nothing more than stolen materials and inauthentic text...
Weblogs, or blogs have become an important new way to publish information, engage in discussions and...
Topical noise in blogs arises when bloggers digress from the central topical thrust of their blogs. ...
People use weblogs to express thoughts, present ideas and share knowledge. However, weblogs can also...
This paper studies how to reduce the amount of human su-pervision for identifying splogs / authentic...
The ease of posting comments and links in blogs has attracted spammers as an alternative venue to co...
A blog, or weblog, is an online diary whose writer is known as a blogger. Many bloggers choose to pu...
Abstract—Current ranking algorithms, such as PageRank, Technorati authority, and BI-Impact, favor bl...
The explosion of blogs on the Web in recent years has fostered research interest in the Information...
Session 5Understanding customers is crucial to companies’ decision-making. With the advent of Web 2....
Blogs are personal online diaries, and a relatively recent form of computer-mediated communication. ...
Part 2: Classification – Pattern Recognition (CLASPR)International audienceAuthorship attribution is...