The aims of this paper are twofold. Our first aim is to compare results of the earlier Terabyte tracks to the Million Query track. We submitted a number of runs using different document representations (such as full-text, title-fields, or incoming anchor-texts) to increase pool diversity. The initial results show broad agreement in system rankings over various measures on topic sets judged at both Terabyte and Million Query tracks, with runs using the full-text index giving superior results on all measures, but also some noteworthy upsets. Our second aim is to explore the use of parsimonious language models for retrieval on terabyte-scale collections. These models are smaller thus more efficient than the standard language models when used a...
Information retrieval algorithms leverage various collection statistics to improve performance. Beca...
While there are many studies on information retrieval models using full-text, there are presently no...
Contains fulltext : 73393.pdf (publisher's version ) (Open Access)Language models ...
In this paper we explore the use of parsimonious language models for web retrieval. These models are...
In this paper we explore the use of parsimonious language models for web retrieval. These models are...
We systematically investigate a new approach to estimating the parameters of language models for inf...
Abstract: IR group of Tsinghua University this year has used its TMiner text retrieval system for in...
Because of the world wide web, information retrieval systems are now used by millions of untrained u...
We describe our participation in the TREC 2004 Web and Terabyte tracks. For the web track, we employ...
Topic models help make sense of large text collections. Automatically evaluating their output and de...
International audienceMost existing Information Retrieval model including probabilistic and vector s...
A retrieval system is a very important part in a question answering framework. It reduces the number...
The main obstacle for providing focused search is the relative opaqueness of search request—searcher...
The main obstacle for providing focused search is the relative opaqueness of search request -- searc...
In the KL divergence framework, the extended language modeling approach has a critical problem of es...
Information retrieval algorithms leverage various collection statistics to improve performance. Beca...
While there are many studies on information retrieval models using full-text, there are presently no...
Contains fulltext : 73393.pdf (publisher's version ) (Open Access)Language models ...
In this paper we explore the use of parsimonious language models for web retrieval. These models are...
In this paper we explore the use of parsimonious language models for web retrieval. These models are...
We systematically investigate a new approach to estimating the parameters of language models for inf...
Abstract: IR group of Tsinghua University this year has used its TMiner text retrieval system for in...
Because of the world wide web, information retrieval systems are now used by millions of untrained u...
We describe our participation in the TREC 2004 Web and Terabyte tracks. For the web track, we employ...
Topic models help make sense of large text collections. Automatically evaluating their output and de...
International audienceMost existing Information Retrieval model including probabilistic and vector s...
A retrieval system is a very important part in a question answering framework. It reduces the number...
The main obstacle for providing focused search is the relative opaqueness of search request—searcher...
The main obstacle for providing focused search is the relative opaqueness of search request -- searc...
In the KL divergence framework, the extended language modeling approach has a critical problem of es...
Information retrieval algorithms leverage various collection statistics to improve performance. Beca...
While there are many studies on information retrieval models using full-text, there are presently no...
Contains fulltext : 73393.pdf (publisher's version ) (Open Access)Language models ...