[[abstract]]©1988 Springer Verlag-Designing efficient parallel algorithms in a message-based parallel computer should consider both time-space tradeoffs and computation-communication tradeoffs. In order to balance these tradeoffs and achieve the optimal performance of an algorith, one has to consider various design parameters such as the number of processors required and the size of partitions. In this paper, we demonstrate that, for certain data parallel algorithms, it is possible to determine these design parameters analytically. To serve as a basis for the discussions that follow, a simple model for the NCUBE hypercube computer is introduced. Using this model, we use two examples, array summation and matrix multiplication, to illustrate ...
In this paper we present an efficient dense matrix multi-plication algorithm for distributed memory ...
Many parallel algorithms use hypercubes as the communication topology among their processes. When su...
A number of parallel formulations of dense matrix multiplication algorithm have been developed. For ...
The task-to-processor mapping problem is addressed in the context of a local-memory multiprocessor w...
Processor allocation and the task scheduling technique in parallel processing systems play a signifi...
Mapping of parallel programs onto parallel computers for efficient execution is a fundamental proble...
We consider the problem of subsystem allocation in the mesh, torus, and hypercube multicomputers. Al...
Mapping of parallel programs onto parallel computers for efficient execution is a fundamental proble...
This paper surveys some theoretical results of the hypercube for design of VLSI architecture. The pa...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
Many parallel algorithms use hypercubes as the communication topology among their processes. When su...
This paper parallelizes the embedding strategy for mapping any two-dimensional grid into its optimal...
Many parallel algorithms exhibit a hypercube communication topology. Such algorithms can easily be e...
In this thesis the design of parallel numerical algorithms is investigated under the framework of th...
Due to the character of the original source materials and the nature of batch digitization, quality ...
In this paper we present an efficient dense matrix multi-plication algorithm for distributed memory ...
Many parallel algorithms use hypercubes as the communication topology among their processes. When su...
A number of parallel formulations of dense matrix multiplication algorithm have been developed. For ...
The task-to-processor mapping problem is addressed in the context of a local-memory multiprocessor w...
Processor allocation and the task scheduling technique in parallel processing systems play a signifi...
Mapping of parallel programs onto parallel computers for efficient execution is a fundamental proble...
We consider the problem of subsystem allocation in the mesh, torus, and hypercube multicomputers. Al...
Mapping of parallel programs onto parallel computers for efficient execution is a fundamental proble...
This paper surveys some theoretical results of the hypercube for design of VLSI architecture. The pa...
[[abstract]]Intensive scientific algorithms can usually be formulated as nested loops which are the ...
Many parallel algorithms use hypercubes as the communication topology among their processes. When su...
This paper parallelizes the embedding strategy for mapping any two-dimensional grid into its optimal...
Many parallel algorithms exhibit a hypercube communication topology. Such algorithms can easily be e...
In this thesis the design of parallel numerical algorithms is investigated under the framework of th...
Due to the character of the original source materials and the nature of batch digitization, quality ...
In this paper we present an efficient dense matrix multi-plication algorithm for distributed memory ...
Many parallel algorithms use hypercubes as the communication topology among their processes. When su...
A number of parallel formulations of dense matrix multiplication algorithm have been developed. For ...