Search engines use replication and distribution of large indices across many query servers to achieve efficient retrieval. Under high query load, queries can be scheduled to replicas that are expected to be idle soonest, facilitated by the use of predicted query response times. However, the overhead of making response time predictions can hinder the usefulness of query scheduling under low query load. In this paper, we propose a hybrid scheduling approach that combines the scheduling methods appropriate for both low and high load conditions, and can adapt in response to changing conditions. We deploy a simulation framework, which is prepared with actual and predicted response times for real Web search queries for one full day. Our experimen...
Web search engines are optimized to reduce the high-percentile response time to consistently provide...
For many search settings, distributed/replicated search en-gines deploy a large number of machines t...
Web search engines are known to cache the results of previously issued queries. The stored results t...
Search engines use replication and distribution of large indices across many query servers to achiev...
Search engines use replication and distribution of large indices across many query servers to achiev...
For increased efficiency, an information retrieval system can split its index into multiple shards, ...
Dynamic pruning strategies permit efficient retrieval by not fully scoring all postings of the docum...
To enhance effectiveness, a user’s query can be rewritten internally by the search engine in many wa...
A commercial web search engine shards its index among many servers, and therefore the response time ...
Search engines use inverted files as index data structures to speed up the solution of user queries...
While Web search engines are built to cope with a large number of queries, query traffic can exceed ...
Large web search engines process billions of queries each day over tens of billions of documents wit...
A search engine infrastructure must be able to provide the same quality of service to all queries re...
Abstract. While Web search engines are built to cope with a large number of queries, query traffic c...
We investigate the impact of query result prefetching on the efficiency and effectiveness of web sea...
Web search engines are optimized to reduce the high-percentile response time to consistently provide...
For many search settings, distributed/replicated search en-gines deploy a large number of machines t...
Web search engines are known to cache the results of previously issued queries. The stored results t...
Search engines use replication and distribution of large indices across many query servers to achiev...
Search engines use replication and distribution of large indices across many query servers to achiev...
For increased efficiency, an information retrieval system can split its index into multiple shards, ...
Dynamic pruning strategies permit efficient retrieval by not fully scoring all postings of the docum...
To enhance effectiveness, a user’s query can be rewritten internally by the search engine in many wa...
A commercial web search engine shards its index among many servers, and therefore the response time ...
Search engines use inverted files as index data structures to speed up the solution of user queries...
While Web search engines are built to cope with a large number of queries, query traffic can exceed ...
Large web search engines process billions of queries each day over tens of billions of documents wit...
A search engine infrastructure must be able to provide the same quality of service to all queries re...
Abstract. While Web search engines are built to cope with a large number of queries, query traffic c...
We investigate the impact of query result prefetching on the efficiency and effectiveness of web sea...
Web search engines are optimized to reduce the high-percentile response time to consistently provide...
For many search settings, distributed/replicated search en-gines deploy a large number of machines t...
Web search engines are known to cache the results of previously issued queries. The stored results t...