Much work has been done on the problem of synthesizing a processor array from a system of recurrence equations. Some researchers limit communication to nearest neighbors in the array; others use broadcast. In many cases, neither of the above approaches result in an optimal execution time. In this paper a technique called bounded broadcast is explored whereby an element of a processor array can broadcast to a bounded number of other processors. This technique is applied to the problems of transitive closure and all-pairs shortest distance, resulting in time complexities that are smaller than those reported previously. In general, the technique can be used to design bounded broadcast systolic arrays for algorithms whose implementation can ben...
The length of the longest common subsequence (LCS) between two strings of M and N characters can be ...
Broadcasting is an information dissemination problem in which information originating at one node of...
An improved method for solving the well-known conflict-free scheduling problem for the locally seque...
Abstract: Many compute-bound software kernels have seen order-of-magnitude speedups on special-purpo...
We present new optimal systolic algorithms for the transitive closure problem on ring and linear arr...
We present a simple method for developing parallel and systolic programs from data dependence. We de...
Forming the transitive closure of a binary relation (or directed graph) is an important part of many...
Systematic methods have been proposed for the design of (semi-) systolic arrays. One approach consis...
The production of regular computations using algorithmic engineering techniques is beginning to play...
An improved method for solving the well-known conflict-free scheduling problem for the locally seque...
Three related problems, among others, are faced when trying to execute an algorithm on a parallel ma...
We deal with the problem of partitioning and mapping uniform loop nests onto physical processor arra...
This paper adresses the problem of efficient mappings of nested loops, and more generally of system...
We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling t...
In this paper, we show that every systolic array executes a Regular Iterative Algorithm with a stron...
The length of the longest common subsequence (LCS) between two strings of M and N characters can be ...
Broadcasting is an information dissemination problem in which information originating at one node of...
An improved method for solving the well-known conflict-free scheduling problem for the locally seque...
Abstract: Many compute-bound software kernels have seen order-of-magnitude speedups on special-purpo...
We present new optimal systolic algorithms for the transitive closure problem on ring and linear arr...
We present a simple method for developing parallel and systolic programs from data dependence. We de...
Forming the transitive closure of a binary relation (or directed graph) is an important part of many...
Systematic methods have been proposed for the design of (semi-) systolic arrays. One approach consis...
The production of regular computations using algorithmic engineering techniques is beginning to play...
An improved method for solving the well-known conflict-free scheduling problem for the locally seque...
Three related problems, among others, are faced when trying to execute an algorithm on a parallel ma...
We deal with the problem of partitioning and mapping uniform loop nests onto physical processor arra...
This paper adresses the problem of efficient mappings of nested loops, and more generally of system...
We consider distributed memory algorithms for the all-pairs shortest paths (APSP) problem. Scaling t...
In this paper, we show that every systolic array executes a Regular Iterative Algorithm with a stron...
The length of the longest common subsequence (LCS) between two strings of M and N characters can be ...
Broadcasting is an information dissemination problem in which information originating at one node of...
An improved method for solving the well-known conflict-free scheduling problem for the locally seque...