Many parallel applications require periodic redistribution of workloads and associated data. In a distributed memory computer, this redistribution can be difficult if limited memory is available for receiving messages. The authors propose a model for optimizing the exchange of messages under such circumstances which they call the minimum phase remapping problem. They first show that the problem is NP-Complete, and then analyze several methodologies for addressing it. First, they show how the problem can be phrased as an instance of multi-commodity flow. Next, they study a continuous approximation to the problem. They show that this continuous approximation has a solution which requires at most two more phases than the optimal discrete solut...
The performance of a High Performance Parallel or Distributed Computation depends heavily on minimiz...
(eng) Minimizing communications when mapping affine loop nests onto distributed memory parallel comp...
Reconfiguration is largely an unexplored property in the context of parallel models of computation. ...
Many parallel applications require periodic redistribution of workloads and associated data. In a di...
Abstract—Many parallel applications require periodic redistribution of workloads and associated data...
This thesis is concerned with the problem of minimizing the interprocessor data communication in par...
227 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1988.Most future supercomputers wi...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Interprocessor communication is an important aspect of parallel processing. Studies have shown that ...
For bulk synchronous computations that have nondeterministic behaviors, dynamic remapping is an effe...
In this paper, we study the various communication algorithms on the pipeline multicomputer. We show ...
For bulk synchronous computations that have non-deterministic behaviors, dynamic remapping is an eff...
The Message Minimizing Load Redistribution Problem is described which arises from the need to redist...
We study the effect of limited communication throughput on parallel computation in a setting where t...
Multicomputer (distributed memory MIMD machines) have emerged as inexpensive, yet powerful parallel...
The performance of a High Performance Parallel or Distributed Computation depends heavily on minimiz...
(eng) Minimizing communications when mapping affine loop nests onto distributed memory parallel comp...
Reconfiguration is largely an unexplored property in the context of parallel models of computation. ...
Many parallel applications require periodic redistribution of workloads and associated data. In a di...
Abstract—Many parallel applications require periodic redistribution of workloads and associated data...
This thesis is concerned with the problem of minimizing the interprocessor data communication in par...
227 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1988.Most future supercomputers wi...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Interprocessor communication is an important aspect of parallel processing. Studies have shown that ...
For bulk synchronous computations that have nondeterministic behaviors, dynamic remapping is an effe...
In this paper, we study the various communication algorithms on the pipeline multicomputer. We show ...
For bulk synchronous computations that have non-deterministic behaviors, dynamic remapping is an eff...
The Message Minimizing Load Redistribution Problem is described which arises from the need to redist...
We study the effect of limited communication throughput on parallel computation in a setting where t...
Multicomputer (distributed memory MIMD machines) have emerged as inexpensive, yet powerful parallel...
The performance of a High Performance Parallel or Distributed Computation depends heavily on minimiz...
(eng) Minimizing communications when mapping affine loop nests onto distributed memory parallel comp...
Reconfiguration is largely an unexplored property in the context of parallel models of computation. ...