Detailed algorithms for all-to-all broadcast and reduction are given for arrays mapped by binary or binary-reflected Gray code encoding to the processing nodes of binary cube networks. Algorithms are also given for the local computation of the array indices for the communicated data, thereby reducing the demand for the communications bandwidth. For the Connection Machine system CM-200, Hamiltonian cycle-based all-to-all communication algorithms yield a performance that is a factor of 2 to 10 higher than the performance offered by algorithms based on trees, butterfly networks, or the Connection Machine router. The peak data rate achieved for all-to-all broadcast on a 2,048-node Connection Machine system CM-200 is 5.4 Gbyte/s. The index order...
Hypercube algorithms are developed for a variety of commun-ication-intensive tasks such as transposi...
We discuss some techniques for preserving locality of reference in index spaces when mapped to memor...
AbstractWe present optimal schedules for permutations in which each node sends one or several unique...
Detailed algorithms for all-to-all broadcast and reduction are given for arrays mapped by binary or ...
We provide performance models for several primitive operations on data structures distributed over m...
All-to-all personalized communication is a class of permutations in which each processor sends a uni...
Graduation date: 1989We will describe two known strategies for static processor\ud allocation in an ...
International audienceThis paper presents two simple all-to-all broadcasting algorithms on honeycomb...
In irregular all-to-all communication, messages are exchanged between every pair of processors. The ...
Both Gray code and binary code are frequently used in mapping arrays into hypercube architectures. W...
Hypercube algorithms are developed for a variety of communication-intensive tasks such as transposin...
We develop an all-to-all broadcast scheme that achieves maximum bandwidth efficiency for clusters wi...
Several recent papers have proposed or analyzed optimal algorithms to route all-to-all personalized ...
A method to reduce broadcast time in wormholerouted hypercube systems is described. The method takes...
[[abstract]]Some common guidelines that can be used to design parallel algorithms under the single-c...
Hypercube algorithms are developed for a variety of commun-ication-intensive tasks such as transposi...
We discuss some techniques for preserving locality of reference in index spaces when mapped to memor...
AbstractWe present optimal schedules for permutations in which each node sends one or several unique...
Detailed algorithms for all-to-all broadcast and reduction are given for arrays mapped by binary or ...
We provide performance models for several primitive operations on data structures distributed over m...
All-to-all personalized communication is a class of permutations in which each processor sends a uni...
Graduation date: 1989We will describe two known strategies for static processor\ud allocation in an ...
International audienceThis paper presents two simple all-to-all broadcasting algorithms on honeycomb...
In irregular all-to-all communication, messages are exchanged between every pair of processors. The ...
Both Gray code and binary code are frequently used in mapping arrays into hypercube architectures. W...
Hypercube algorithms are developed for a variety of communication-intensive tasks such as transposin...
We develop an all-to-all broadcast scheme that achieves maximum bandwidth efficiency for clusters wi...
Several recent papers have proposed or analyzed optimal algorithms to route all-to-all personalized ...
A method to reduce broadcast time in wormholerouted hypercube systems is described. The method takes...
[[abstract]]Some common guidelines that can be used to design parallel algorithms under the single-c...
Hypercube algorithms are developed for a variety of commun-ication-intensive tasks such as transposi...
We discuss some techniques for preserving locality of reference in index spaces when mapped to memor...
AbstractWe present optimal schedules for permutations in which each node sends one or several unique...