We give asymptotically equal lower and upper bounds for the number of parallel I/O operations required to perform BMMC permutations (defined by a characteristic matrix that is nonsingular over GF (2)) on parallel disk systems. Under the Vitter-Shriver parallel-disk model with N records, D disks, block size B, and M records of RAM, we show a universal lower bound of\Omega \Gamma N BD \Gamma 1 + rank(fl) lg(M=B) \Delta \Delta parallel I/Os for performing a BMMC permutation, where fl is the lower left lg(N=B) \Theta lg B submatrix of the characteristic matrix. We adapt this lower bound to show that the algorithm for BPC permutations in [Cor93] is asymptotically optimal. We also present an algorithm that uses at most 2N BD \Gamma 6 ...
The most efficient way to calculate strong bisimilarity is by finding the relational coarsest partit...
Abstract. BMF is a formalism that allows us to design parallel programs independently of the target ...
We develop an algorithm for parallel disk sorting, whose I/O cost approaches the lower bound and tha...
We give asymptotically equal lower and upper bounds for the number of parallel I/O operations requir...
The ability to perform permutations of large data sets in place reduces the amount of necessary avai...
The ability to perform permutations of large data sets in place reduces the amount of necessary avai...
The authors implemented and measured several methods to perform BMMC permutations on the MasPar MP-2...
We provide time lower bounds for sequential and parallel algorithms deciding bisimulation on labeled...
In this paper we present a simple parallel sorting algorithm and illustrate two applications. The al...
For the design and analysis of algorithms that process huge data sets, a machine model is needed tha...
In this paper we introduce parallel versions of two hierarchical memory models and give optimal algo...
In this paper we present a simple parallel sorting algorithm and illustrate its application in gener...
For the design and analysis of algorithms that process huge data sets, a machine model is needed tha...
Blockwise access to data is a central theme in the design of efficient external memory (EM) algorith...
With the widening gap between processor speeds and disk access speeds, the I/O bottleneck has become...
The most efficient way to calculate strong bisimilarity is by finding the relational coarsest partit...
Abstract. BMF is a formalism that allows us to design parallel programs independently of the target ...
We develop an algorithm for parallel disk sorting, whose I/O cost approaches the lower bound and tha...
We give asymptotically equal lower and upper bounds for the number of parallel I/O operations requir...
The ability to perform permutations of large data sets in place reduces the amount of necessary avai...
The ability to perform permutations of large data sets in place reduces the amount of necessary avai...
The authors implemented and measured several methods to perform BMMC permutations on the MasPar MP-2...
We provide time lower bounds for sequential and parallel algorithms deciding bisimulation on labeled...
In this paper we present a simple parallel sorting algorithm and illustrate two applications. The al...
For the design and analysis of algorithms that process huge data sets, a machine model is needed tha...
In this paper we introduce parallel versions of two hierarchical memory models and give optimal algo...
In this paper we present a simple parallel sorting algorithm and illustrate its application in gener...
For the design and analysis of algorithms that process huge data sets, a machine model is needed tha...
Blockwise access to data is a central theme in the design of efficient external memory (EM) algorith...
With the widening gap between processor speeds and disk access speeds, the I/O bottleneck has become...
The most efficient way to calculate strong bisimilarity is by finding the relational coarsest partit...
Abstract. BMF is a formalism that allows us to design parallel programs independently of the target ...
We develop an algorithm for parallel disk sorting, whose I/O cost approaches the lower bound and tha...