Abstract The explosion of content in distributed infer-marion retrieval (IR) systems requires new mechanisms to attain timely and accurate retrieval of unstructured text. In this paper, we compare two mechanisms to improve IR sys-tem performance: partial collection replication and caching. When queries have locality, both mechanisms return results more quickly than sending queries to the original collec-tion(s). Caches return results when queries exactly match a previous one. Partial replicas are a form of caching that return results when the IR technology determines the query is a good match. Caches are simpler and faster, but repli-cas can increase locality by detecting similarity between queries that are not exactly the same. We use real...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
We believe the techniques for evaluating clone detectors can be improved, and that the improvements ...
International audienceSimilarity search is a key operation in multimedia retrieval systems and recom...
The explosion of content in distributed information retrieval (IR) systems requires new mechanisms t...
The explosion of content in distributed information retrieval (IR) systems requires new mechanisms i...
As information explodes across the Internet and intranets, information retrieval (IR) systems must c...
International audienceThis paper focuses on similarity caching systems, in which a user request for ...
In this paper we explore the problem of Caching of Search Engine Query Results, in order to reduce t...
Feature-rich data, such as audio-video recordings, digital images, and results of scientific experim...
Search engines and large-scale IR systems need to cache query results for efficiency and scalability...
For peer-to-peer web search engines it is important to quickly process queries and return search res...
Search engines and large scale IR systems need to cache query results for efficiency and scalability...
International audienceSimilarity caching allows requests for an item i to be served by a similar ite...
Caching is one of the techniques that Information Retrieval Systems (IRS) and Web Search Engines (WS...
The amount of information available over the Internet is increasing daily as well as the importance ...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
We believe the techniques for evaluating clone detectors can be improved, and that the improvements ...
International audienceSimilarity search is a key operation in multimedia retrieval systems and recom...
The explosion of content in distributed information retrieval (IR) systems requires new mechanisms t...
The explosion of content in distributed information retrieval (IR) systems requires new mechanisms i...
As information explodes across the Internet and intranets, information retrieval (IR) systems must c...
International audienceThis paper focuses on similarity caching systems, in which a user request for ...
In this paper we explore the problem of Caching of Search Engine Query Results, in order to reduce t...
Feature-rich data, such as audio-video recordings, digital images, and results of scientific experim...
Search engines and large-scale IR systems need to cache query results for efficiency and scalability...
For peer-to-peer web search engines it is important to quickly process queries and return search res...
Search engines and large scale IR systems need to cache query results for efficiency and scalability...
International audienceSimilarity caching allows requests for an item i to be served by a similar ite...
Caching is one of the techniques that Information Retrieval Systems (IRS) and Web Search Engines (WS...
The amount of information available over the Internet is increasing daily as well as the importance ...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
We believe the techniques for evaluating clone detectors can be improved, and that the improvements ...
International audienceSimilarity search is a key operation in multimedia retrieval systems and recom...