Previous schemes for sorting on general-purpose parallel machines have had to choose between poor load balancing and irregular communication or multiple rounds of all-to-all personalized communication. In this paper, we introduce a novel variation on sample sort which uses only two rounds of regular all-to-all personalized communication in a scheme that yields very good load balancing with virtually no overhead. Moreover, unlike previous variations, our algorithm efficiently handles the presence of duplicate values without the overhead of tagging each element with a unique identifier. This algorithm was implemented in SPLIT-C and run on a variety of platforms, including the Thinking Machines CM-5, the IBM SP-2, and the Cray Research T3D. We...
A common statistical problem is that of finding the median element in a set of data. This paper pres...
Sorting is one of the most fundamental algorithmic kernels, used by a large fraction of computer app...
Clusters of symmetric multiprocessors (SMPs) have emerged as the primary candidates for large scale...
We introduce a new deterministic parallel sorting algorithm for distributed memory machines based on...
Previous schemes for sorting on general-purpose parallel machines have had to choose between poor lo...
Previous schemes for sorting on general-purpose parallel machines have had to choose between poor l...
We introduce a new deterministic parallel sorting algorithm based on the regular sampling approach...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
We consider the often-studied problem of sorting, for a parallel computer. Given an input array dis...
Parallel sorting algorithms have been proposed for a variety of multiple instruction streams, multip...
Many sorting algorithms that perform well on uniformly distributed data suffer significant performan...
The Parallel Disks Model (PDM) has been proposed to alleviate the I/O bottleneck that arises in the ...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, a...
The Parallel Disks Model (PDM) has been proposed to alleviate the I/O bottle-neck that arises in the...
Abstract. The Parallel Disks Model (PDM) has been proposed to al-leviate the I/O bottleneck that ari...
A common statistical problem is that of finding the median element in a set of data. This paper pres...
Sorting is one of the most fundamental algorithmic kernels, used by a large fraction of computer app...
Clusters of symmetric multiprocessors (SMPs) have emerged as the primary candidates for large scale...
We introduce a new deterministic parallel sorting algorithm for distributed memory machines based on...
Previous schemes for sorting on general-purpose parallel machines have had to choose between poor lo...
Previous schemes for sorting on general-purpose parallel machines have had to choose between poor l...
We introduce a new deterministic parallel sorting algorithm based on the regular sampling approach...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
We consider the often-studied problem of sorting, for a parallel computer. Given an input array dis...
Parallel sorting algorithms have been proposed for a variety of multiple instruction streams, multip...
Many sorting algorithms that perform well on uniformly distributed data suffer significant performan...
The Parallel Disks Model (PDM) has been proposed to alleviate the I/O bottleneck that arises in the ...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, a...
The Parallel Disks Model (PDM) has been proposed to alleviate the I/O bottle-neck that arises in the...
Abstract. The Parallel Disks Model (PDM) has been proposed to al-leviate the I/O bottleneck that ari...
A common statistical problem is that of finding the median element in a set of data. This paper pres...
Sorting is one of the most fundamental algorithmic kernels, used by a large fraction of computer app...
Clusters of symmetric multiprocessors (SMPs) have emerged as the primary candidates for large scale...