In this paper, we consider the problem of selection on coarse-grained distributed memory parallel computers. We discuss several deterministic and randomized algorithms for parallel selection. We also consider several algorithms for load balancing needed to keep a balanced distribution of data across processors during the execution of the selection algorithms. We have carried out detailed implementations of all the algorithms discussed on the CM-5 and report on the experimental results. We demonstrate that the randomized algorithms are superior to their deterministic counterparts. Index Terms: coarse-grained, hypercubes, load balancing, median finding, meshes, randomized algorithms, selection, parallel algorithms, parallel computers. 1 Intro...
Bulk-synchronous parallelism (BSP) is a simple and efficient paradigm for parallel algorithm design ...
Bulk-synchronous parallelism (BSP) is a simple and efficient paradigm for parallel algorithm design ...
We show that selection on an input of size N can be performed on a P-node hypercube (P = N/(log N)) ...
In this paper, we consider the problem of selection on coarse-grained distributed memory parallel co...
In this paper, we consider the problem of selection on coarse-grained distributed memory parallel co...
We present two deterministic parallel Selection algorithms for distributed memory machines, under th...
A common statistical problem is that of nding the median element in a set of data. This paper presen...
Technical ReportA common statistical problem is that of finding the median element in a set of data....
AbstractWe present a randomized selection algorithm whose performance is analyzed in an architecture...
The selection problem of size $n$ is, given a set of $n$ elements drawn from an ordered universe and...
A common statistical problem is that of finding the median element in a set of data. This paper pre...
A common statistical problem is that of finding the median element in a set of data. This paper pres...
With the widening gap between processor speeds and disk access speeds, the I/O bottleneck has become...
Multiselection is the problem of selecting multiple elements at specified ranks from a set of arbitr...
Abstract. In this paper we study the problems of sorting and selection on the Distributed Memory Bus...
Bulk-synchronous parallelism (BSP) is a simple and efficient paradigm for parallel algorithm design ...
Bulk-synchronous parallelism (BSP) is a simple and efficient paradigm for parallel algorithm design ...
We show that selection on an input of size N can be performed on a P-node hypercube (P = N/(log N)) ...
In this paper, we consider the problem of selection on coarse-grained distributed memory parallel co...
In this paper, we consider the problem of selection on coarse-grained distributed memory parallel co...
We present two deterministic parallel Selection algorithms for distributed memory machines, under th...
A common statistical problem is that of nding the median element in a set of data. This paper presen...
Technical ReportA common statistical problem is that of finding the median element in a set of data....
AbstractWe present a randomized selection algorithm whose performance is analyzed in an architecture...
The selection problem of size $n$ is, given a set of $n$ elements drawn from an ordered universe and...
A common statistical problem is that of finding the median element in a set of data. This paper pre...
A common statistical problem is that of finding the median element in a set of data. This paper pres...
With the widening gap between processor speeds and disk access speeds, the I/O bottleneck has become...
Multiselection is the problem of selecting multiple elements at specified ranks from a set of arbitr...
Abstract. In this paper we study the problems of sorting and selection on the Distributed Memory Bus...
Bulk-synchronous parallelism (BSP) is a simple and efficient paradigm for parallel algorithm design ...
Bulk-synchronous parallelism (BSP) is a simple and efficient paradigm for parallel algorithm design ...
We show that selection on an input of size N can be performed on a P-node hypercube (P = N/(log N)) ...