The paper is focused on blogosphere research based on the TREC blog distillation task, and aims to explore unbiased and significant features automatically and efficiently. Feedback from faceted feeds is introduced to harvest relevant features and information gain is used to select discriminative features, including the unigrams as well as the patterns of unigram associations. Meanwhile facing the terabyte blog dataset, some flexible processing is adopted in our approach. The evaluation result shows that the selected feedback features can greatly improve the performance and adapt well to the terabyte data
This paper presents the work done for the TREC 2008 blog distillation task. We introduce two new met...
Blogs have grown explosively nowadays and this makes the study of information retrieval (IR) in blog...
We describe the participation of the University of Amsterdam's ILPS group in the web, blog, web, ent...
This paper presents our system and results for the Feed Distillation task in the Blog track at TREC ...
Abstract. This paper outlines our experiments carried out at TREC 2009 Blog Distillation Task. Our s...
User generated content in general, and blogs in particular, form an interesting and relatively littl...
This paper systematically exploited various lexical features for opinion analysis on blog data using...
Topical noise in blogs arises when bloggers digress from the central topical thrust of their blogs. ...
We describe the participation of the University of Amsterdam's ILPS group in the blog, enterprise an...
doi:10.4156/jdcta.vol4. issue8.9 With the increasing of blog users, the traditional blog search can ...
This paper describes the PKUTM participation in the TREC 2010 Blog Track. We only concentrated on th...
Abstract. User generated content in general, and blogs in particu-lar, form an interesting and relat...
We address the task of (blog) feed distillation: to find blogs that are principally devoted to a giv...
We describe our participation in the TREC 2007 Blog track. In the opinion task we looked at the diff...
For opinion finding task our method of the combination of 5 Windows method and Pseudo Relevance Feed...
This paper presents the work done for the TREC 2008 blog distillation task. We introduce two new met...
Blogs have grown explosively nowadays and this makes the study of information retrieval (IR) in blog...
We describe the participation of the University of Amsterdam's ILPS group in the web, blog, web, ent...
This paper presents our system and results for the Feed Distillation task in the Blog track at TREC ...
Abstract. This paper outlines our experiments carried out at TREC 2009 Blog Distillation Task. Our s...
User generated content in general, and blogs in particular, form an interesting and relatively littl...
This paper systematically exploited various lexical features for opinion analysis on blog data using...
Topical noise in blogs arises when bloggers digress from the central topical thrust of their blogs. ...
We describe the participation of the University of Amsterdam's ILPS group in the blog, enterprise an...
doi:10.4156/jdcta.vol4. issue8.9 With the increasing of blog users, the traditional blog search can ...
This paper describes the PKUTM participation in the TREC 2010 Blog Track. We only concentrated on th...
Abstract. User generated content in general, and blogs in particu-lar, form an interesting and relat...
We address the task of (blog) feed distillation: to find blogs that are principally devoted to a giv...
We describe our participation in the TREC 2007 Blog track. In the opinion task we looked at the diff...
For opinion finding task our method of the combination of 5 Windows method and Pseudo Relevance Feed...
This paper presents the work done for the TREC 2008 blog distillation task. We introduce two new met...
Blogs have grown explosively nowadays and this makes the study of information retrieval (IR) in blog...
We describe the participation of the University of Amsterdam's ILPS group in the web, blog, web, ent...