This Thesis aims to improve state-of-the-art algorithms for Top-K document retrieval problem. This problem asks for finding the most relevant k documents for a given query in collection of Web Pages. After implementing and testing several state of the art algorithms for Top-K retrieval, we focus on Block-Max-Wand that is the best known solution for this problem. Block-Max-Wand is based on the idea of splitting posting lists into fixed-size blocks and approximating the scores with the maximum value of the block which they belong to. This improves query time but increases space usage. In this Thesis we propose to use variable-size blocks in the above algorithm. Indeed, we observe that the algorithm is faster if the scores of the postings are ...
Previous research into the efficiency of text retrieval systems has dealt primarily with methods tha...
Previous research into the efficiency of text retrieval systems has dealt primarily with methods tha...
Traditional top-k algorithms, e.g., TA and NRA, have been successfully applied in many areas such as...
Large web search engines are facing formidable performance challenges because they have to process t...
Query processing is one of the main bo.lenecks in large-scale search engines. Retrieving the top k m...
AbstractLarge search engines are complex systems composed by several services. Each service is compo...
Abstract: An optimal index solving top-k document retrieval [Navarro and Nekrich, SODA’12] takes O(m...
Let D = {d1, d2, d3,..., dD} be a given set of D (string) docu-ments of total length n. The top-k do...
This thesis focuses on top-k document retrieval. The study of such query evaluation method is motiva...
Abstract. Given a collection of strings (called documents), the top-k document retrieval problem is ...
[[abstract]]In the document retrieval problem [9], we are given a collection of documents (strings) ...
Let Dbea set of n elements each associated with a real-valued weight, and Q be the set of all possib...
International audienceThe general problem of answering top-k queries can be modeled using lists of d...
International audienceThe general problem of answering top-k queries can be modeled using lists of d...
We propose and study a new ranking problem in versioned data-bases. Consider a database of versioned...
Previous research into the efficiency of text retrieval systems has dealt primarily with methods tha...
Previous research into the efficiency of text retrieval systems has dealt primarily with methods tha...
Traditional top-k algorithms, e.g., TA and NRA, have been successfully applied in many areas such as...
Large web search engines are facing formidable performance challenges because they have to process t...
Query processing is one of the main bo.lenecks in large-scale search engines. Retrieving the top k m...
AbstractLarge search engines are complex systems composed by several services. Each service is compo...
Abstract: An optimal index solving top-k document retrieval [Navarro and Nekrich, SODA’12] takes O(m...
Let D = {d1, d2, d3,..., dD} be a given set of D (string) docu-ments of total length n. The top-k do...
This thesis focuses on top-k document retrieval. The study of such query evaluation method is motiva...
Abstract. Given a collection of strings (called documents), the top-k document retrieval problem is ...
[[abstract]]In the document retrieval problem [9], we are given a collection of documents (strings) ...
Let Dbea set of n elements each associated with a real-valued weight, and Q be the set of all possib...
International audienceThe general problem of answering top-k queries can be modeled using lists of d...
International audienceThe general problem of answering top-k queries can be modeled using lists of d...
We propose and study a new ranking problem in versioned data-bases. Consider a database of versioned...
Previous research into the efficiency of text retrieval systems has dealt primarily with methods tha...
Previous research into the efficiency of text retrieval systems has dealt primarily with methods tha...
Traditional top-k algorithms, e.g., TA and NRA, have been successfully applied in many areas such as...