Top-$k$ query processing is a fundamental building block for efficient ranking in a large number of applications. Efficiency is a central issue, especially for distributed settings, when the data is spread across different nodes in a network. This paper introduces novel optimization methods for top-$k$ aggregation queries in such distributed environments. The optimizations can be applied to all algorithms that fall into the frameworks of the prior TPUT and KLEE methods. The optimizations address three degrees of freedom: 1) hierarchically grouping input lists into top-$k$ operator trees and optimizing the tree structure, 2) computing data-adaptive scan depths for different input sources, and 3) data-adaptive sampling of a small sub...
Ranking queries are essential tools to process large amounts of probabilistic data that encode expon...
A top-k query combines different rankings of the same set of objects and returns the k objects with ...
A top-k query combines different rankings of the same set of objects and returns the k objects with ...
Top-$k$ query processing is a fundamental building block for efficient ranking in a large number of ...
Top-k query processing is a fundamental building block for efficient ranking in a large number of ap...
Top-k query processing is a fundamental building block for efficient ranking in a large number of ap...
Distributed top-$k$ query processing is increasingly becoming an essential functionality in a large ...
Distributed top-k query processing is increasingly becoming an essential functionality in a large nu...
We consider distributed top-k queries in wide-area networks where the index lists for the attribute...
This paper presents a new algorithm to answer top-k queries (e.g. “find the k objects with the highe...
This paper addresses the efficient processing of top-k queries in wide-area distributed data reposi...
Query processing in traditional information management systems has moved from an exact match model t...
Abstract — Searching and mining large graphs today is critical to a variety of application domains, ...
We consider the problem of query optimization in distributed data stream systems where multiple cont...
Supporting queries over dispersed data stored in large-scale distributed systems, such as peer-to-pe...
Ranking queries are essential tools to process large amounts of probabilistic data that encode expon...
A top-k query combines different rankings of the same set of objects and returns the k objects with ...
A top-k query combines different rankings of the same set of objects and returns the k objects with ...
Top-$k$ query processing is a fundamental building block for efficient ranking in a large number of ...
Top-k query processing is a fundamental building block for efficient ranking in a large number of ap...
Top-k query processing is a fundamental building block for efficient ranking in a large number of ap...
Distributed top-$k$ query processing is increasingly becoming an essential functionality in a large ...
Distributed top-k query processing is increasingly becoming an essential functionality in a large nu...
We consider distributed top-k queries in wide-area networks where the index lists for the attribute...
This paper presents a new algorithm to answer top-k queries (e.g. “find the k objects with the highe...
This paper addresses the efficient processing of top-k queries in wide-area distributed data reposi...
Query processing in traditional information management systems has moved from an exact match model t...
Abstract — Searching and mining large graphs today is critical to a variety of application domains, ...
We consider the problem of query optimization in distributed data stream systems where multiple cont...
Supporting queries over dispersed data stored in large-scale distributed systems, such as peer-to-pe...
Ranking queries are essential tools to process large amounts of probabilistic data that encode expon...
A top-k query combines different rankings of the same set of objects and returns the k objects with ...
A top-k query combines different rankings of the same set of objects and returns the k objects with ...