Query processing is one of the main bo.lenecks in large-scale search engines. Retrieving the top k most relevant documents for a given query can be extremely expensive, as it involves scoring large amounts of documents. Several dynamic pruning techniques have been introduced in the literature to tackle this problem, such as BlockMaxWAND, which splits the inverted index into constantsized blocks and stores the maximum document-Term scores per block; this information can be used during query execution to safely skip low-score documents, producing many-fold speedups over exhaustive methods. We introduce a re.nement for BlockMaxWANDthat uses variablesized blocks, rather than constant-sized. We set up the problem of deciding the block partitioni...
Web search engines have to deal with a rapidly increasing amount of information, high query loads an...
Processing top-k bag-of-words queries is critical to many information retrieval applications, includ...
AbstractWe give new space/time tradeoffs for compressed indexes that answer document retrieval queri...
Query processing is one of the main bo.lenecks in large-scale search engines. Retrieving the top k m...
BlockMaxWand is a recent advance on the Wand dynamic pruning technique, which allows efficient retr...
BlockMaxWand is a recent advance on the Wand dynamic pruning technique, which allows efficient retri...
This Thesis aims to improve state-of-the-art algorithms for Top-K document retrieval problem. This p...
AbstractLarge search engines are complex systems composed by several services. Each service is compo...
Large web search engines are facing formidable performance challenges because they have to process t...
© 2017 ACM. Many real applications in real-time news stream advertising call for efficient processin...
Previous research into the efficiency of text retrieval systems has dealt primarily with methods tha...
Previous research into the efficiency of text retrieval systems has dealt primarily with methods tha...
We propose to use a score cache, which stores the score of the k.th result of a query, to accelerate...
Abstract. Given a collection of strings (called documents), the top-k document retrieval problem is ...
Abstract: An optimal index solving top-k document retrieval [Navarro and Nekrich, SODA’12] takes O(m...
Web search engines have to deal with a rapidly increasing amount of information, high query loads an...
Processing top-k bag-of-words queries is critical to many information retrieval applications, includ...
AbstractWe give new space/time tradeoffs for compressed indexes that answer document retrieval queri...
Query processing is one of the main bo.lenecks in large-scale search engines. Retrieving the top k m...
BlockMaxWand is a recent advance on the Wand dynamic pruning technique, which allows efficient retr...
BlockMaxWand is a recent advance on the Wand dynamic pruning technique, which allows efficient retri...
This Thesis aims to improve state-of-the-art algorithms for Top-K document retrieval problem. This p...
AbstractLarge search engines are complex systems composed by several services. Each service is compo...
Large web search engines are facing formidable performance challenges because they have to process t...
© 2017 ACM. Many real applications in real-time news stream advertising call for efficient processin...
Previous research into the efficiency of text retrieval systems has dealt primarily with methods tha...
Previous research into the efficiency of text retrieval systems has dealt primarily with methods tha...
We propose to use a score cache, which stores the score of the k.th result of a query, to accelerate...
Abstract. Given a collection of strings (called documents), the top-k document retrieval problem is ...
Abstract: An optimal index solving top-k document retrieval [Navarro and Nekrich, SODA’12] takes O(m...
Web search engines have to deal with a rapidly increasing amount of information, high query loads an...
Processing top-k bag-of-words queries is critical to many information retrieval applications, includ...
AbstractWe give new space/time tradeoffs for compressed indexes that answer document retrieval queri...