AbstractA processor is balanced in carrying out a computation if its computing time equals its I/O time. When the computation bandwidth of a processor is increased, like when multiple processors are incorporated to form an array, the critical question is to what degree the processor's memory must be enlarged in order to alleviate the I/O bottleneck to keep the computation balanced. In this paper, for the sorting problem, we present two balanced algorithms on linearly connected and mesh-connected processor arrays, respectively, and show that they reach the derived lower bounds of memory sizes. We also verify that the time complexities of the algorithms are optimal under their respective hardware constraints
AbstractWe define two conditions on a random access machine (RAM) with arithmetic and Boolean instru...
The recently discovered Shear-sort algorithm requires log2<SUB>n</SUB> iterations of row and column ...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...
AbstractA processor is balanced in carrying out a computation if its computing time equals its I/O t...
AbstractIn this paper, a processing element (PE) is characterized by its computation bandwidth, I/O ...
We study the fundamental problem of sorting n integers of w bits on a unit-cost RAM with word size w...
AbstractWe show that a unit-cost RAM with a word length ofwbits can sortnintegers in the range 0…2w−...
AbstractÐWe present a hardware-algorithm for sortingN elements using either a p-sorter or a sorting ...
Abstract—Energy consumption has become a critical factor constraining the design of massively parall...
Sorting is one of the fundamental problems in computer science. In this thesis we present three indi...
The growing importance and interest in parallel processing within Computer Sciences are undeniable, ...
Abstract. We study the problem of sorting on a parallel computer with limited communication bandwidt...
The sorting problem is to arrange N values in a distributed system of N processors into sorted order...
Sorting is a fundamental problem with applications in all areas of computer science and engineering....
We show that sorting an input of size N = n superscript 2 can be performed by an n X n mesh-connect...
AbstractWe define two conditions on a random access machine (RAM) with arithmetic and Boolean instru...
The recently discovered Shear-sort algorithm requires log2<SUB>n</SUB> iterations of row and column ...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...
AbstractA processor is balanced in carrying out a computation if its computing time equals its I/O t...
AbstractIn this paper, a processing element (PE) is characterized by its computation bandwidth, I/O ...
We study the fundamental problem of sorting n integers of w bits on a unit-cost RAM with word size w...
AbstractWe show that a unit-cost RAM with a word length ofwbits can sortnintegers in the range 0…2w−...
AbstractÐWe present a hardware-algorithm for sortingN elements using either a p-sorter or a sorting ...
Abstract—Energy consumption has become a critical factor constraining the design of massively parall...
Sorting is one of the fundamental problems in computer science. In this thesis we present three indi...
The growing importance and interest in parallel processing within Computer Sciences are undeniable, ...
Abstract. We study the problem of sorting on a parallel computer with limited communication bandwidt...
The sorting problem is to arrange N values in a distributed system of N processors into sorted order...
Sorting is a fundamental problem with applications in all areas of computer science and engineering....
We show that sorting an input of size N = n superscript 2 can be performed by an n X n mesh-connect...
AbstractWe define two conditions on a random access machine (RAM) with arithmetic and Boolean instru...
The recently discovered Shear-sort algorithm requires log2<SUB>n</SUB> iterations of row and column ...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...