We measure the WT10g test collection, used in the TREC-9 and TREC 2001 Web Tracks, and the.GOV test collection used in the TREC 2002 Web and Interactive Tracks, with common measures used in the web topology community, in order to see if these collections “look like ” the web. This is not an idle question; characteristics of the web, such as power law relationships, diameter, and connected components have all been observed within the scope of general web crawls, constructed by blindly following links. The.GOV collection is a fairly straightforward 18GB crawl of sites in the.gov domain. In contrast, WT10g was carved out from a much larger crawl specifically to be a web search test collection within the reach of university researchers. Do such...
Although at first sight, the web track might seem a copy of the ad hoc track, we discovered that som...
The understanding of the immense and intricate topological structure of the World Wide Web (WWW) is ...
The study of the Web as a graph is not only fascinating in its own right, but also yields valuable i...
Experiments using TREC-style topic descriptions and relevance judgments have recently been carried o...
Abstract. Due to the popularity of Web search engines, a large proportion of real text retrieval que...
Linkage analysis as an aid to web search has been assumed to be of significant benefit and we know t...
In line with the wishes of last year's participants, this year's VLC track was essentially...
The goal of the TREC Web track is to explore and evaluate retrieval ap-proaches over large-scale sub...
The goal of the TREC Web track over the past few years has been to explore and evaluate innovative r...
Abstract. The lack of a large scale Chinese test collection is an obstacle to the Chinese informatio...
Abstract: We describe our experiments with the.GOV collection in both the topic distillation and nam...
The lack of a large scale Chinese test collection is an obstacle to the Chinese information retrieva...
A frozen 18.5 million page snapshot of part of the Web has been created to enable and encourage mean...
Past research into text retrieval methods for the Web has been restricted by the lack of a test coll...
Anchor text has been proofed efficient in former TREC experiments on homepage finding task [1] and s...
Although at first sight, the web track might seem a copy of the ad hoc track, we discovered that som...
The understanding of the immense and intricate topological structure of the World Wide Web (WWW) is ...
The study of the Web as a graph is not only fascinating in its own right, but also yields valuable i...
Experiments using TREC-style topic descriptions and relevance judgments have recently been carried o...
Abstract. Due to the popularity of Web search engines, a large proportion of real text retrieval que...
Linkage analysis as an aid to web search has been assumed to be of significant benefit and we know t...
In line with the wishes of last year's participants, this year's VLC track was essentially...
The goal of the TREC Web track is to explore and evaluate retrieval ap-proaches over large-scale sub...
The goal of the TREC Web track over the past few years has been to explore and evaluate innovative r...
Abstract. The lack of a large scale Chinese test collection is an obstacle to the Chinese informatio...
Abstract: We describe our experiments with the.GOV collection in both the topic distillation and nam...
The lack of a large scale Chinese test collection is an obstacle to the Chinese information retrieva...
A frozen 18.5 million page snapshot of part of the Web has been created to enable and encourage mean...
Past research into text retrieval methods for the Web has been restricted by the lack of a test coll...
Anchor text has been proofed efficient in former TREC experiments on homepage finding task [1] and s...
Although at first sight, the web track might seem a copy of the ad hoc track, we discovered that som...
The understanding of the immense and intricate topological structure of the World Wide Web (WWW) is ...
The study of the Web as a graph is not only fascinating in its own right, but also yields valuable i...