To enhance effectiveness, a user’s query can be rewritten internally by the search engine in many ways, for example by applying prox- imity, or by expanding the query with related terms. However, ap- proaches that benefit effectiveness often have a negative impact on efficiency, which has impacts upon the user satisfaction, if the query is excessively slow. In this paper, we propose a novel framework for using the predicted execution time of various query rewritings to select between alternatives on a per-query basis, in a manner that ensures both effectiveness and efficiency. In particular, we propose the prediction of the execution time of ephemeral (e.g., proximity) posting lists generated from uni-gram inverted index posting lists, whic...
Large scale retrieval systems often employ cascaded ranking architectures, in which an initial set o...
Search engines use replication and distribution of large indices across many query servers to achiev...
Web search engines are optimized to reduce the high-percentile response time to consistently provide...
To enhance effectiveness, a user’s query can be rewritten internally by the search engine in many wa...
Dynamic pruning strategies permit efficient retrieval by not fully scoring all postings of the docum...
Dynamic pruning strategies are effective yet permit efficient retrieval by pruning - i.e. not fully ...
Search engines are exceptionally important tools for accessing information in today’s world. In sati...
Modern search engines face enormous performance challenges. The most popular ones process tens of th...
A commercial web search engine shards its index among many servers, and therefore the response time ...
Predicting the query latency by a search engine has important benefits, for instance, in allowing th...
Predicting the query latency by a search engine has important benefits, for instance, in allowing th...
Retrieval can be made more efficient by deploying dynamic pruning strategies such as WAND, which do ...
For increased efficiency, an information retrieval system can split its index into multiple shards, ...
Web search engines have to deal with a rapidly increasing amount of information, high query loads an...
Search engines use replication and distribution of large indices across many query servers to achiev...
Large scale retrieval systems often employ cascaded ranking architectures, in which an initial set o...
Search engines use replication and distribution of large indices across many query servers to achiev...
Web search engines are optimized to reduce the high-percentile response time to consistently provide...
To enhance effectiveness, a user’s query can be rewritten internally by the search engine in many wa...
Dynamic pruning strategies permit efficient retrieval by not fully scoring all postings of the docum...
Dynamic pruning strategies are effective yet permit efficient retrieval by pruning - i.e. not fully ...
Search engines are exceptionally important tools for accessing information in today’s world. In sati...
Modern search engines face enormous performance challenges. The most popular ones process tens of th...
A commercial web search engine shards its index among many servers, and therefore the response time ...
Predicting the query latency by a search engine has important benefits, for instance, in allowing th...
Predicting the query latency by a search engine has important benefits, for instance, in allowing th...
Retrieval can be made more efficient by deploying dynamic pruning strategies such as WAND, which do ...
For increased efficiency, an information retrieval system can split its index into multiple shards, ...
Web search engines have to deal with a rapidly increasing amount of information, high query loads an...
Search engines use replication and distribution of large indices across many query servers to achiev...
Large scale retrieval systems often employ cascaded ranking architectures, in which an initial set o...
Search engines use replication and distribution of large indices across many query servers to achiev...
Web search engines are optimized to reduce the high-percentile response time to consistently provide...