This paper presents solutions for the problem of many-to-many personalized communication, with bounded incoming and outgoing traffic, on a distributed memory parallel machine. We present a two-stage algorithm that decomposes the many-to-many communication with possibly high variance in message size into two communications with low message size variance. The algorithm is deterministic and takes time 2tµ (+ lower order terms) when t \u3e= O(p2; + p tau/µ). Here t is the maximum outgoing or incoming traffic at any processor, tau is the startup overhead and µ is the inverse of the data transfer rate. Optimality is achieved when the traffic is large, a condition that is usually satisfied in practice on coarse-grained architectures. The algorithm...
In this paper, we study the various communication algorithms on the pipeline multicomputer. We show ...
Hypercube algorithms are developed for a variety of communication-intensive tasks such as transposin...
A sequential computer executes one CPU instruction at a time. Over the years sequential computers ha...
This paper presents algorithms for implementing the transportation primitive on a distributed memory...
In this paper we present several algorithms for performing all-to-many personalized communication on...
In this paper we present several algorithms for all-too-many personalized communications which avoid...
Parallelization of many irregular applications results in unstructured collective communication. In ...
This paper presents algorithms for implementing the transportation primitive on a distributed memory...
Hypercube algorithms may be developed for a variety of communication-intensive tasks such as sending...
With the advent of new routing methods, the distance to which a message is sent is becoming relative...
The current trends in high performance computing show that large machines with tens of thousands of ...
In this paper we propose a new approach to the study of the communication requirements of distribute...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...
Cover title.Includes bibliographical references (p. 12).Supported by the National Science Foundation...
) David R. Helman David A. Bader Joseph J'aJ'a y Institute for Advanced Computer Stud...
In this paper, we study the various communication algorithms on the pipeline multicomputer. We show ...
Hypercube algorithms are developed for a variety of communication-intensive tasks such as transposin...
A sequential computer executes one CPU instruction at a time. Over the years sequential computers ha...
This paper presents algorithms for implementing the transportation primitive on a distributed memory...
In this paper we present several algorithms for performing all-to-many personalized communication on...
In this paper we present several algorithms for all-too-many personalized communications which avoid...
Parallelization of many irregular applications results in unstructured collective communication. In ...
This paper presents algorithms for implementing the transportation primitive on a distributed memory...
Hypercube algorithms may be developed for a variety of communication-intensive tasks such as sending...
With the advent of new routing methods, the distance to which a message is sent is becoming relative...
The current trends in high performance computing show that large machines with tens of thousands of ...
In this paper we propose a new approach to the study of the communication requirements of distribute...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...
Cover title.Includes bibliographical references (p. 12).Supported by the National Science Foundation...
) David R. Helman David A. Bader Joseph J'aJ'a y Institute for Advanced Computer Stud...
In this paper, we study the various communication algorithms on the pipeline multicomputer. We show ...
Hypercube algorithms are developed for a variety of communication-intensive tasks such as transposin...
A sequential computer executes one CPU instruction at a time. Over the years sequential computers ha...