Simulation and analysis have shown that selective search can reduce the cost of large-scale distributed information retrieval. By partitioning the collection into small topical shards, and then using a resource ranking algorithm to choose a subset of shards to search for each query, fewer postings are evaluated. Here we extend the study of selective search using a fine-grained simulation investigating: selective search efficiency in a parallel query processing environment; the difference in efficiency when term-based and sample-based resource selection algorithms are used; and the effect of two policies for assigning index shards to machines. Results obtained for two large datasets and four large query logs confirm that selective search is ...
This paper evaluates the retrieval effectiveness of distributed information retrieval systems in rea...
This paper describes HyPS, a hybrid parallel window / distributed tree search algorithm. Using this ...
When a digital library is decomposed into many geographically distributed repositories, search effic...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
Our work shows that the query latency for selective search over a topically partitioned collection c...
Selective search is a distributed retrieval technique that reduces the computational cost of large-s...
Abstract The proliferation of online information resources increases the importance of effective and...
A search engine infrastructure must be able to provide the same quality of service to all queries re...
Large-scale web and text retrieval systems deal with amounts of data that greatly exceed the capacit...
Abstract—To address the rapid growth of the Internet, modern Web search engines have to adopt distri...
To address the rapid growth of the Internet, moder Web search engines have to adopt distributed orga...
Information retrieval systems often have to deal with very large amounts of data. They must be able ...
As information explodes across the Internet and intranets, information retrieval (IR) systems must c...
The creation of very large-scale multimedia search engines, with more than one billion images and v...
Abstract: In our previous works, we advocated the use of distributed search architectures where mult...
This paper evaluates the retrieval effectiveness of distributed information retrieval systems in rea...
This paper describes HyPS, a hybrid parallel window / distributed tree search algorithm. Using this ...
When a digital library is decomposed into many geographically distributed repositories, search effic...
Simulation and analysis have shown that selective search can reduce the cost of large-scale distribu...
Our work shows that the query latency for selective search over a topically partitioned collection c...
Selective search is a distributed retrieval technique that reduces the computational cost of large-s...
Abstract The proliferation of online information resources increases the importance of effective and...
A search engine infrastructure must be able to provide the same quality of service to all queries re...
Large-scale web and text retrieval systems deal with amounts of data that greatly exceed the capacit...
Abstract—To address the rapid growth of the Internet, modern Web search engines have to adopt distri...
To address the rapid growth of the Internet, moder Web search engines have to adopt distributed orga...
Information retrieval systems often have to deal with very large amounts of data. They must be able ...
As information explodes across the Internet and intranets, information retrieval (IR) systems must c...
The creation of very large-scale multimedia search engines, with more than one billion images and v...
Abstract: In our previous works, we advocated the use of distributed search architectures where mult...
This paper evaluates the retrieval effectiveness of distributed information retrieval systems in rea...
This paper describes HyPS, a hybrid parallel window / distributed tree search algorithm. Using this ...
When a digital library is decomposed into many geographically distributed repositories, search effic...