This chapter introduces information extraction from blog texts. It argues that the classical techniques for information extraction that are commonly used for mining well-formed texts lose some of their validity in the context of blogs. This finding is demonstrated by considering each step in the information extraction process and by illustrating this problem in different applications. In order to tackle the problem of mining content from blogs, algorithms are developed that combine different sources of evidence in the most flexible way. The chapter concludes with ideas for future research.Written for a general publicstatus: publishe
User generated content forms an important domain for mining knowledge. In this paper, we address the...
This research study aims at detecting topics and extracting themes(subtopics) from the blogosphere’s...
In this chapter we define information extraction from text, describe common information extraction t...
Web logs, or blogs, have become more widely-used in recent years. Many people are now documenting th...
This report outlines an inquiry into the area of web data extraction, conducted within the context o...
This book offers a comprehensive overview of the various concepts and research issues about blogs or...
Weblogs, or blogs, are becoming more and more interesting for a wide audience. Millions of personal,...
As the web keeps growing, identifying and retrieving useful in-formation from this huge amount of da...
This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cl...
Blogs are a dynamic communication medium which has been widely established on the web. The BlogForev...
We study the problem of automatically extracting information networks formed by recognizable entitie...
Information extraction regards the processes of structuring and combining content that is explicitly...
AbstractEven though blog contents vary a lot in quality, the disclosure of personal opinions and the...
Blogs represent an important new arena for knowledge discovery in open source intelligence gathering...
User generated content in general, and blogs in particular, form an interesting and relatively littl...
User generated content forms an important domain for mining knowledge. In this paper, we address the...
This research study aims at detecting topics and extracting themes(subtopics) from the blogosphere’s...
In this chapter we define information extraction from text, describe common information extraction t...
Web logs, or blogs, have become more widely-used in recent years. Many people are now documenting th...
This report outlines an inquiry into the area of web data extraction, conducted within the context o...
This book offers a comprehensive overview of the various concepts and research issues about blogs or...
Weblogs, or blogs, are becoming more and more interesting for a wide audience. Millions of personal,...
As the web keeps growing, identifying and retrieving useful in-formation from this huge amount of da...
This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cl...
Blogs are a dynamic communication medium which has been widely established on the web. The BlogForev...
We study the problem of automatically extracting information networks formed by recognizable entitie...
Information extraction regards the processes of structuring and combining content that is explicitly...
AbstractEven though blog contents vary a lot in quality, the disclosure of personal opinions and the...
Blogs represent an important new arena for knowledge discovery in open source intelligence gathering...
User generated content in general, and blogs in particular, form an interesting and relatively littl...
User generated content forms an important domain for mining knowledge. In this paper, we address the...
This research study aims at detecting topics and extracting themes(subtopics) from the blogosphere’s...
In this chapter we define information extraction from text, describe common information extraction t...