Data partitioning is a critical operation for manipulating large datasets because it subdivides tasks into pieces that are more amenable to efficient processing. It is often the limiting factor in database performance and represents a significant fraction of the overall runtime of large data queries. This article measures the performance and energy of state-of-the-art software partitioners, and describes and evaluates a hardware range partitioner that further improves efficiency. The software implementation is broken into two phases, allowing separate analysis of the partition function computation and data shuffling costs. Although range partitioning is commonly thought to be more expensive than simpler strategies such as hash partitioning,...
Partitioning a system's functionality among interacting hardware and software components is an impor...
[[abstract]]The concept of green storage in cluster computing has generated a great deal of interest...
Heterogeneous System Development needs Hardware/Software Partitioning performed early on in the deve...
With global pool of data growing at over 2.5 quinitillion bytes per day and over 90% of all data in ...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
With the breakdown of Dennard Scaling and Moore's law, power consumption appears to be a primary cha...
In the past decade, the exponential growth in commodity CPUs speed has far outpaced advances in memo...
The performance of supercomputers is not growing anymore at the rate it once used to. Several years ...
This paper presents a framework for characterizing the distribution of fine-grained parallelism, dat...
The failure of Dennard scaling [Bohr, 2007] and the rapid growth of data produced and consumed daily...
In the last decades, high-performance large-scale systems have been a fundamental tool for scientifi...
In multiprocessor systems, data parallelism is the execution of the same task on data distributed ac...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
The last decade has been characterized by radical changes in the computing landscape. We have witnes...
During the last two decades, computer hardware has experienced remarkable developments. Especially C...
Partitioning a system's functionality among interacting hardware and software components is an impor...
[[abstract]]The concept of green storage in cluster computing has generated a great deal of interest...
Heterogeneous System Development needs Hardware/Software Partitioning performed early on in the deve...
With global pool of data growing at over 2.5 quinitillion bytes per day and over 90% of all data in ...
The multicore era has initiated a move to ubiquitous parallelization of software. In the process, co...
With the breakdown of Dennard Scaling and Moore's law, power consumption appears to be a primary cha...
In the past decade, the exponential growth in commodity CPUs speed has far outpaced advances in memo...
The performance of supercomputers is not growing anymore at the rate it once used to. Several years ...
This paper presents a framework for characterizing the distribution of fine-grained parallelism, dat...
The failure of Dennard scaling [Bohr, 2007] and the rapid growth of data produced and consumed daily...
In the last decades, high-performance large-scale systems have been a fundamental tool for scientifi...
In multiprocessor systems, data parallelism is the execution of the same task on data distributed ac...
Parallelism is ubiquitous in modern computer architectures. Heterogeneity of CPU cores and deep memo...
The last decade has been characterized by radical changes in the computing landscape. We have witnes...
During the last two decades, computer hardware has experienced remarkable developments. Especially C...
Partitioning a system's functionality among interacting hardware and software components is an impor...
[[abstract]]The concept of green storage in cluster computing has generated a great deal of interest...
Heterogeneous System Development needs Hardware/Software Partitioning performed early on in the deve...