Weblogs, or blogs have become an important new way to publish information, engage in discussions and form communities. The increasing popularity of blogs has given rise to search and analysis engines focusing on the “blogosphere”. A key requirement of such systems is to identify blogs as they crawl the Web. While this ensures that only blogs are indexed, blog search engines are also often overwhelmed by spam blogs (splogs). Splogs not only incur computational overheads but also reduce user satisfaction. In this paper we first describe experimental results of blog identification using Support Vector Ma-chines (SVM). We compare results of using different feature sets and introduce new features for blog iden-tification. We then report prelimin...
In this paper, we propose the architecture for a weblog data mining system. Our objective is to allo...
Abstract—Number of blogs is increasing at a rapid pace and many potential applications for opinion d...
Over the past few years, weblogs have emerged as a new com-munication and publication medium on the ...
This paper focuses on spam blog (splog) detection. Blogs are highly popular, new media social commun...
Weblogs, or blogs are an important new way to publish information, engage in discussions, and form c...
This paper studies how to reduce the amount of human su-pervision for identifying splogs / authentic...
Spam blog is a subset of blog which contains nothing more than stolen materials and inauthentic text...
Spam blogs (splogs) have become a major problem in the increasingly popular blogosphere. Splogs are ...
The ease of posting comments and links in blogs has attracted spammers as an alternative venue to co...
Weblogs, or blogs, are becoming more and more interesting for a wide audience. Millions of personal,...
Splog is the key challenge in the access of blogosphere. Existing splog-filtering methods are restri...
Abstract: Problem statement: Information search, collection and categorization from the blogosphere ...
Over the last few years, blogs (web logs) have gained massive popularity and have become one of the ...
Blog is a new media emerging on the Internet representing a new source of information. However, the ...
Purpose – The purpose of this article is to explore the capabilities and limitations of weblog searc...
In this paper, we propose the architecture for a weblog data mining system. Our objective is to allo...
Abstract—Number of blogs is increasing at a rapid pace and many potential applications for opinion d...
Over the past few years, weblogs have emerged as a new com-munication and publication medium on the ...
This paper focuses on spam blog (splog) detection. Blogs are highly popular, new media social commun...
Weblogs, or blogs are an important new way to publish information, engage in discussions, and form c...
This paper studies how to reduce the amount of human su-pervision for identifying splogs / authentic...
Spam blog is a subset of blog which contains nothing more than stolen materials and inauthentic text...
Spam blogs (splogs) have become a major problem in the increasingly popular blogosphere. Splogs are ...
The ease of posting comments and links in blogs has attracted spammers as an alternative venue to co...
Weblogs, or blogs, are becoming more and more interesting for a wide audience. Millions of personal,...
Splog is the key challenge in the access of blogosphere. Existing splog-filtering methods are restri...
Abstract: Problem statement: Information search, collection and categorization from the blogosphere ...
Over the last few years, blogs (web logs) have gained massive popularity and have become one of the ...
Blog is a new media emerging on the Internet representing a new source of information. However, the ...
Purpose – The purpose of this article is to explore the capabilities and limitations of weblog searc...
In this paper, we propose the architecture for a weblog data mining system. Our objective is to allo...
Abstract—Number of blogs is increasing at a rapid pace and many potential applications for opinion d...
Over the past few years, weblogs have emerged as a new com-munication and publication medium on the ...