This paper discusses a novel approach developed for static index pruning that takes into account the locality of occurrences of words in the text. We use this new approach to propose and experiment simple and effective pruning methods that allow a fast construction of the pruned index. The methods proposed here are specially useful for pruning in environments where the document database changes continuously, such as large scale web search engines. Extensive experiments are presented showing that the proposed methods can achieve high compression rates while maintaining the quality of results for the most common query types present in modern search engines, i.e. conjunctive and phrase queries. In the experiments, our locality based pruning ap...
Abstract. Document-centric static index pruning methods provide smaller indexes and faster query tim...
The presence of spam in a document ranking is a major issue for Web search engines. Common approache...
Retrieval can be made more efficient by deploying dynamic pruning strategies such as WAND, which do ...
Static index pruning techniques permanently remove a presumably redundant part of an inverted file, ...
Static index pruning techniques permanently remove a presumably redundant part of an inverted file, ...
Web search engines typically index and retrieve at the page level. In this study, we investigate a d...
The Web search engines maintain large-scale inverted indexes which are queried thousands of times pe...
We propose incorporating query views in a number of static pruning strategies, namely term-centric, ...
Carterette, BenStatic index pruning methods have been proposed to reduce the index size of informati...
Search engines are exceptionally important tools for accessing information in today’s world. In sati...
Web search engines have to deal with a rapidly increasing amount of information, high query loads an...
Results caching is an efficient technique for reducing the query processing load, hence it is common...
Large web search engines process billions of queries each day over tens of billions of documents wit...
We present a static index pruning method, to be used in ad-hoc document retrieval tasks, that follow...
Static index pruning techniques aim at removing from the posting lists of an inverted file the refer...
Abstract. Document-centric static index pruning methods provide smaller indexes and faster query tim...
The presence of spam in a document ranking is a major issue for Web search engines. Common approache...
Retrieval can be made more efficient by deploying dynamic pruning strategies such as WAND, which do ...
Static index pruning techniques permanently remove a presumably redundant part of an inverted file, ...
Static index pruning techniques permanently remove a presumably redundant part of an inverted file, ...
Web search engines typically index and retrieve at the page level. In this study, we investigate a d...
The Web search engines maintain large-scale inverted indexes which are queried thousands of times pe...
We propose incorporating query views in a number of static pruning strategies, namely term-centric, ...
Carterette, BenStatic index pruning methods have been proposed to reduce the index size of informati...
Search engines are exceptionally important tools for accessing information in today’s world. In sati...
Web search engines have to deal with a rapidly increasing amount of information, high query loads an...
Results caching is an efficient technique for reducing the query processing load, hence it is common...
Large web search engines process billions of queries each day over tens of billions of documents wit...
We present a static index pruning method, to be used in ad-hoc document retrieval tasks, that follow...
Static index pruning techniques aim at removing from the posting lists of an inverted file the refer...
Abstract. Document-centric static index pruning methods provide smaller indexes and faster query tim...
The presence of spam in a document ranking is a major issue for Web search engines. Common approache...
Retrieval can be made more efficient by deploying dynamic pruning strategies such as WAND, which do ...