We introduce a new deterministic parallel sorting algorithm based on the regular sampling approach. The algorithm uses only two rounds of regular all-to-all personalized communication in a scheme that yields very good load balancing with virtually no overhead. Moreover, unlike previous variations, our algorithm efficiently handles the presence of duplicate values without the overhead of tagging each element with a unique identifier. This algorithm was implemented in Split C, the IBM SP-2-WN, and the Cray Research T3D. We ran our code using widely different benchmarks to examine the dependence of our algorithm on the input distribution. Our experimental results illustrate the efficiency and scalability of our algorit...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
The Parallel Disks Model (PDM) has been proposed to alleviate the I/O bottleneck that arises in the ...
International audienceWe tackle the feasibility and efficiency of two new parallel algorithms that s...
We introduce a new deterministic parallel sorting algorithm for distributed memory machines based on...
Previous schemes for sorting on general-purpose parallel machines have had to choose between poor lo...
Previous schemes for sorting on general-purpose parallel machines have had to choose between poor l...
Previous schemes for sorting on general-purpose parallel machines have had to choose between poor lo...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
We consider the often-studied problem of sorting, for a parallel computer. Given an input array dis...
Parallel sorting algorithms have been proposed for a variety of multiple instruction streams, multip...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, a...
Sorting is one of the most fundamental algorithmic kernels, used by a large fraction of computer app...
AbstractIn this paper, a refined deterministic sampling strategy is presented. It allows to improve ...
Many sorting algorithms that perform well on uniformly distributed data suffer significant performan...
Clusters of symmetric multiprocessors (SMPs) have emerged as the primary candidates for large scale...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
The Parallel Disks Model (PDM) has been proposed to alleviate the I/O bottleneck that arises in the ...
International audienceWe tackle the feasibility and efficiency of two new parallel algorithms that s...
We introduce a new deterministic parallel sorting algorithm for distributed memory machines based on...
Previous schemes for sorting on general-purpose parallel machines have had to choose between poor lo...
Previous schemes for sorting on general-purpose parallel machines have had to choose between poor l...
Previous schemes for sorting on general-purpose parallel machines have had to choose between poor lo...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
We consider the often-studied problem of sorting, for a parallel computer. Given an input array dis...
Parallel sorting algorithms have been proposed for a variety of multiple instruction streams, multip...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, a...
Sorting is one of the most fundamental algorithmic kernels, used by a large fraction of computer app...
AbstractIn this paper, a refined deterministic sampling strategy is presented. It allows to improve ...
Many sorting algorithms that perform well on uniformly distributed data suffer significant performan...
Clusters of symmetric multiprocessors (SMPs) have emerged as the primary candidates for large scale...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
The Parallel Disks Model (PDM) has been proposed to alleviate the I/O bottleneck that arises in the ...
International audienceWe tackle the feasibility and efficiency of two new parallel algorithms that s...