AbstractIn this paper matching upper and lower bounds for broadcast on general purpose parallel computation models that exploit network locality are proven. These models try to capture both the general purpose properties of models like the PRAM or BSP on the one hand, and to exploit network locality of special purpose models like meshes, hypercubes, etc., on the other hand. They do so by charging a cost l(|i−j|) for a communication between processors i and j, where l is a suitably chosen latency function.An upper bound T(p)=∑i=0loglogp2i·l(p1/2i) on the runtime of a broadcast on a p processor H-PRAM is given, for an arbitrary latency function l(k).The main contribution of the paper is a matching lower bound, holding for all latency function...
[[abstract]]Some common guidelines that can be used to design parallel algorithms under the single-c...
this paper, we apply the locality concept to the communication patterns of parallel programs operat...
Abstract. NOWs (Networks of workstations) have been extensively used to execute parallel application...
Recently there has been an increasing interest in models of parallel computation that account for th...
AbstractWe propose a model, LPRAM, for parallel random access machines with local memory that captur...
Recently there has been an increasing interest in models of parallel computation that account for th...
AbstractThe goal of this paper is to present practical experiments on broadcasting algorithms on a c...
AbstractWe consider the broadcasting operation in point-to-point packet-switched parallel and distri...
Broadcast Communication is among the most primitive collective capabilities of any message passing n...
AbstractWe consider an extension of the well-known PRAM model for parallel distributed-memory comput...
There are a number of models that were proposed in recent years for message passing parallel systems...
AbstractIn this paper we show how parallel algorithms can be turned into efficient streaming algorit...
Abstract. We prove the correctness of optimized parallel implementations of a generalized broadcast,...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...
We introduce a model of parallel computation that retains the ideal properties of the PRAM by using ...
[[abstract]]Some common guidelines that can be used to design parallel algorithms under the single-c...
this paper, we apply the locality concept to the communication patterns of parallel programs operat...
Abstract. NOWs (Networks of workstations) have been extensively used to execute parallel application...
Recently there has been an increasing interest in models of parallel computation that account for th...
AbstractWe propose a model, LPRAM, for parallel random access machines with local memory that captur...
Recently there has been an increasing interest in models of parallel computation that account for th...
AbstractThe goal of this paper is to present practical experiments on broadcasting algorithms on a c...
AbstractWe consider the broadcasting operation in point-to-point packet-switched parallel and distri...
Broadcast Communication is among the most primitive collective capabilities of any message passing n...
AbstractWe consider an extension of the well-known PRAM model for parallel distributed-memory comput...
There are a number of models that were proposed in recent years for message passing parallel systems...
AbstractIn this paper we show how parallel algorithms can be turned into efficient streaming algorit...
Abstract. We prove the correctness of optimized parallel implementations of a generalized broadcast,...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...
We introduce a model of parallel computation that retains the ideal properties of the PRAM by using ...
[[abstract]]Some common guidelines that can be used to design parallel algorithms under the single-c...
this paper, we apply the locality concept to the communication patterns of parallel programs operat...
Abstract. NOWs (Networks of workstations) have been extensively used to execute parallel application...