This paper considers the implementation and evaluation of broadcast, scatter and gather communication operations in heterogeneous machines organized as a Hierarchy of Processor-And-Memory (HPAM). The top levels of the hierarchy consist of a small number of fast processors, whereas the bottom levels consist of a large number of slow processors. Each HPAM level consists of a homogeneous processor mesh. Routing within each level and across levels uses an extended form of X-Y dimension ordered routing. The execution times of three collective communication operations are analytically evaluated in the context of two-level HPAM machines. For each operation, two different alternatives are considered depending on the absence or presence of hardware ...
AbstractIn the theory of dissemination of information in interconnection networks (gossiping and bro...
A method to reduce broadcast time in wormholerouted hypercube systems is described. The method takes...
Accelerators have revolutionised the high performance computing (HPC) community. Despite their advan...
This thesis outlines a cost-effective multiprocessor architecture that takes into consideration the ...
A large potential exists for increasing the communication performance of hypercube multiprocessors. ...
In this paper, we consider the communications involved in the execution of a complex application, de...
[[abstract]]This paper presents efficient algorithms for broadcasting on heterogeneous switch-based ...
Hypercube algorithms are developed for a variety of commun-ication-intensive tasks such as transposi...
Networks of Workstations (NOW) have become an attractive alternative platform for high performance c...
In this paper, we develop portable and scalable algorithms for performing irregular all-to-all commu...
International audienceIn this paper, we consider the communications involved by the execution of a c...
Introduction Applications are an important driving force behind the emergence of new machine archit...
(eng) In this paper, we consider the communications involved by the execution of a complex applicati...
Collective communication allows efficient communication and synchronization among a collection of pr...
Abstract-There are a number of models that were proposed in recent years for message passing paralle...
AbstractIn the theory of dissemination of information in interconnection networks (gossiping and bro...
A method to reduce broadcast time in wormholerouted hypercube systems is described. The method takes...
Accelerators have revolutionised the high performance computing (HPC) community. Despite their advan...
This thesis outlines a cost-effective multiprocessor architecture that takes into consideration the ...
A large potential exists for increasing the communication performance of hypercube multiprocessors. ...
In this paper, we consider the communications involved in the execution of a complex application, de...
[[abstract]]This paper presents efficient algorithms for broadcasting on heterogeneous switch-based ...
Hypercube algorithms are developed for a variety of commun-ication-intensive tasks such as transposi...
Networks of Workstations (NOW) have become an attractive alternative platform for high performance c...
In this paper, we develop portable and scalable algorithms for performing irregular all-to-all commu...
International audienceIn this paper, we consider the communications involved by the execution of a c...
Introduction Applications are an important driving force behind the emergence of new machine archit...
(eng) In this paper, we consider the communications involved by the execution of a complex applicati...
Collective communication allows efficient communication and synchronization among a collection of pr...
Abstract-There are a number of models that were proposed in recent years for message passing paralle...
AbstractIn the theory of dissemination of information in interconnection networks (gossiping and bro...
A method to reduce broadcast time in wormholerouted hypercube systems is described. The method takes...
Accelerators have revolutionised the high performance computing (HPC) community. Despite their advan...