This paper presents algorithms for implementing the transportation primitive on a distributed memory parallel architecture. The transportation primitive performs many-to-many personalized communication with bounded incoming and outgoing trac. We present a two-stage deterministic algorithm that decomposes the communication with possibly high variance in message size into two communication stages with low message size variance. If the maximum outgoing or incoming trac at any processor is t, transportation can be done in 2t time (+ lower order terms) when t O(p 2 + p=) ( is the inverse of the data transfer rate, is the startup overhead). If the maximum outgoing and incoming trac are r and c respectively, transportation can be done in (r+c...
Abstract—This paper addresses the problem of improving the performance of the Rapidly-exploring Rand...
AbstractWe propose a model, LPRAM, for parallel random access machines with local memory that captur...
In distributed shared memory multiprocessors, remote memory accesses generate processor-to-memory tr...
This paper presents algorithms for implementing the transportation primitive on a distributed memory...
This paper presents solutions for the problem of many-to-many personalized communication, with bound...
This dissertation focuses on scalable parallel algorithms for irregular communication, random data a...
We present algorithms for the transportation of data in parallel and distributed systems that would ...
In this paper, we study the various communication algorithms on the pipeline multicomputer. We show ...
. Efficient communication in networks is a prerequisite to exploit the performance of large parallel...
In this paper we present several algorithms for performing all-to-many personalized communication on...
Abstract. Parallelizing a sequential algorithm—i.e., manually or automatically converting it into an...
In this paper we propose a new approach to the study of the communication requirements of distribute...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...
This paper considers the problem of creating message-passing protocols for parallel computers. It is...
) David R. Helman David A. Bader Joseph J'aJ'a y Institute for Advanced Computer Stud...
Abstract—This paper addresses the problem of improving the performance of the Rapidly-exploring Rand...
AbstractWe propose a model, LPRAM, for parallel random access machines with local memory that captur...
In distributed shared memory multiprocessors, remote memory accesses generate processor-to-memory tr...
This paper presents algorithms for implementing the transportation primitive on a distributed memory...
This paper presents solutions for the problem of many-to-many personalized communication, with bound...
This dissertation focuses on scalable parallel algorithms for irregular communication, random data a...
We present algorithms for the transportation of data in parallel and distributed systems that would ...
In this paper, we study the various communication algorithms on the pipeline multicomputer. We show ...
. Efficient communication in networks is a prerequisite to exploit the performance of large parallel...
In this paper we present several algorithms for performing all-to-many personalized communication on...
Abstract. Parallelizing a sequential algorithm—i.e., manually or automatically converting it into an...
In this paper we propose a new approach to the study of the communication requirements of distribute...
AbstractWe study the effect of limited communication throughput on parallel computation in a setting...
This paper considers the problem of creating message-passing protocols for parallel computers. It is...
) David R. Helman David A. Bader Joseph J'aJ'a y Institute for Advanced Computer Stud...
Abstract—This paper addresses the problem of improving the performance of the Rapidly-exploring Rand...
AbstractWe propose a model, LPRAM, for parallel random access machines with local memory that captur...
In distributed shared memory multiprocessors, remote memory accesses generate processor-to-memory tr...