The problem of performing a global combine (summation) operation on distributed memory computers using a two-dimensional mesh interconnect with wormhole routing is considered. We present algorithms that are asymptotically optimal for short vectors (O(log(p)) for p processing nodes) and for long vectors (O(n) for n data elements per node), as well as hybrid algorithms that are superior for intermediate n. The algorithms are analyzed using detailed performance models that include the effects of link conflicts and other characteristics of the underlying communication system. The models are validated using experimental data from the Intel Touchstone DELTA computer. We show that no one algorithm is optimal for all vector lengths; rather, each of...
Multicasting is an information dissemination problem which consists, for a processor of a distribute...
We report on a project to develop a unified approach for building a library of collective communicat...
The Reflecting and Growing mappings have been proposed to map parallel divide-and-conquer algorithms...
The complete exchange (or all-to-all personalized) communication pattern occurs frequently in many i...
[[abstract]]A new approach to the design of collective communication operations in wormhole-routed m...
[[abstract]]Most MPC networks use wormhole routing to reduce the effect of path length on communicat...
In this paper, we outline a unified approach for building a library of collective communication oper...
Several algorithms for performing gossiping on one- and higher dimensional meshes are presented. As ...
The growing demand for high processing power in various scientific and engineering applications has ...
The growing demand for high processing power in various scientific and engineering applications has ...
Abstract-Consider a message-passing system of n processors, in which each processor holds one piece ...
Efficient routing of messages is the key to the performance of multicomputers. Multicast communicati...
2D-mesh and torus networks have often been proposed as the interconnection pattern for parallel comp...
The aim of this thesis is the study of the most useful communication schemes, specially thebroadcast...
The aim of this thesis is the study of the most useful communication schemes, specially thebroadcast...
Multicasting is an information dissemination problem which consists, for a processor of a distribute...
We report on a project to develop a unified approach for building a library of collective communicat...
The Reflecting and Growing mappings have been proposed to map parallel divide-and-conquer algorithms...
The complete exchange (or all-to-all personalized) communication pattern occurs frequently in many i...
[[abstract]]A new approach to the design of collective communication operations in wormhole-routed m...
[[abstract]]Most MPC networks use wormhole routing to reduce the effect of path length on communicat...
In this paper, we outline a unified approach for building a library of collective communication oper...
Several algorithms for performing gossiping on one- and higher dimensional meshes are presented. As ...
The growing demand for high processing power in various scientific and engineering applications has ...
The growing demand for high processing power in various scientific and engineering applications has ...
Abstract-Consider a message-passing system of n processors, in which each processor holds one piece ...
Efficient routing of messages is the key to the performance of multicomputers. Multicast communicati...
2D-mesh and torus networks have often been proposed as the interconnection pattern for parallel comp...
The aim of this thesis is the study of the most useful communication schemes, specially thebroadcast...
The aim of this thesis is the study of the most useful communication schemes, specially thebroadcast...
Multicasting is an information dissemination problem which consists, for a processor of a distribute...
We report on a project to develop a unified approach for building a library of collective communicat...
The Reflecting and Growing mappings have been proposed to map parallel divide-and-conquer algorithms...