Optimal Splitters for Database Partitioning with Size Bounds

Ross, Kenneth A.
Cieslewicz, John

Open PDF

Open link

Publication date

January 2008

DOI

10.7916/D87W6M2S

Publisher

Columbia University Libraries/Information Services

Abstract

Partitioning is an important step in several database algorithms, including sorting, aggregation, and joins. Partitioning is also fundamental for dividing work into equal-sized (or balanced) parallel subtasks. In this paper, we aim to find, materialize and maintain a set of partitioning elements (splitters) for a data set. Unlike traditional partitioning elements, our splitters define both inequality and equality partitions, which allows us to bound the size of the inequality partitions. We provide an algorithm for determining an optimal set of splitters from a sorted data set and show that it has time complexity O(k lg_2 N), where k is the number of splitters requested and N is the size of the data set. We show how the algorithm can be ext...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Optimal Splitters for Database Partitioning with Size Bounds

Abstract

Extracted data

Optimal Splitters for Database Partitioning with Size Bounds

Abstract

Extracted data

Related items

Related items