Search engines employ caching techniques in main memory to improve system efficiency and scalability. In this thesis, we focus on improving the cache performance for web search engines where our contributions can be separated into two main parts. Firstly, we investigate the impact of the sample size for frequency statistics for most popular cache eviction strategies in the literature, and show that cache performance improves with larger samples, i.e., by storing the frequencies of all (or, most of) the queries seen by the search engine. We mitigate the cost of storing a large history of frequencies by using a Counting Bloom Filter based data structure that is able to store frequency statistics in a compact manner, while still providing comp...
Query result caching is an important mechanism for search engine efficiency. In this study, we first...
This paper discusses the design and implementation of SDC, a new caching strategy aimed to efficient...
Web search engines are known to cache the results of previously issued queries. The stored results t...
In this paper we explore the problem of Caching of Search Engine Query Results, in order to reduce t...
This article discusses efficiency and effectiveness issues in caching the results of queries submitt...
This article discusses efficiency and effectiveness issues in caching the results of queries submitt...
Web search engines process several millions of queries per second over several billions of documents...
Abstract. This paper studies the impact of the tail of the query distribution on caches of Web searc...
We propose to use a score cache, which stores the score of the k.th result of a query, to accelerate...
Cataloged from PDF version of article.Search engines and large-scale IR systems need to cache query ...
Large web search engines need to be able to process thousands of queries per second on collections ...
Search engines and large-scale IR systems need to cache query results for efficiency and scalability...
Search engines and large-scale IR systems need to cache query results for efficiency and scalability...
Web search engines serve millions of query requests per day. Caching query results is one of the mos...
A commonly used technique for improving search engine performance is result caching. In result cachi...
Query result caching is an important mechanism for search engine efficiency. In this study, we first...
This paper discusses the design and implementation of SDC, a new caching strategy aimed to efficient...
Web search engines are known to cache the results of previously issued queries. The stored results t...
In this paper we explore the problem of Caching of Search Engine Query Results, in order to reduce t...
This article discusses efficiency and effectiveness issues in caching the results of queries submitt...
This article discusses efficiency and effectiveness issues in caching the results of queries submitt...
Web search engines process several millions of queries per second over several billions of documents...
Abstract. This paper studies the impact of the tail of the query distribution on caches of Web searc...
We propose to use a score cache, which stores the score of the k.th result of a query, to accelerate...
Cataloged from PDF version of article.Search engines and large-scale IR systems need to cache query ...
Large web search engines need to be able to process thousands of queries per second on collections ...
Search engines and large-scale IR systems need to cache query results for efficiency and scalability...
Search engines and large-scale IR systems need to cache query results for efficiency and scalability...
Web search engines serve millions of query requests per day. Caching query results is one of the mos...
A commonly used technique for improving search engine performance is result caching. In result cachi...
Query result caching is an important mechanism for search engine efficiency. In this study, we first...
This paper discusses the design and implementation of SDC, a new caching strategy aimed to efficient...
Web search engines are known to cache the results of previously issued queries. The stored results t...