In this paper we present several algorithms for decomposing all-to-many personalized communication into a set of disjoint partial permutations. These partial permutations avoid node contention as well as link contention. We discuss the theoretical complexity of these algorithms and study their effectiveness both from the view of static scheduling and from runtime scheduling. Experimental results for our algorithms are presented on the iPSC/860. 1 Introduction Experience with parallel computing has shown that a "good" mapping is a critical part of executing a program on massively parallel processing machines. The mapping typically can be performed statically or dynamically. For most regular and synchronous problems, this mapping c...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
Communication overhead is one of the main factors that can limit the speedup of parallel programs on...
Hypercube algorithms are developed for a variety of commun-ication-intensive tasks such as transposi...
In this paper we present several algorithms for decomposing all-to-many personalized communication i...
With the advent of new routing methods, the distance to which a message is sent is becoming relative...
In this paper we present several algorithms for performing all-to-many personalized communication on...
With the advent of new routing methods, the distance to which a message is sent is becoming relative...
In this paper we present several algorithms for all-too-many personalized communications which avoid...
Parallelization of many irregular applications results in unstructured collective communication. In ...
The objective of this research is to propose a low-complexity static scheduling and allocation algor...
Automatic partitioning, scheduling and code generation are of major importance in the development of...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
The objective of this research is to propose a lowcomplexity static scheduling and allocation algori...
Scheduling is an important issue in parallel processing. Most scheduling algorithms makes assigns ta...
In applications requiring very high throughput or which have real-time deadlines, the use of paralle...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
Communication overhead is one of the main factors that can limit the speedup of parallel programs on...
Hypercube algorithms are developed for a variety of commun-ication-intensive tasks such as transposi...
In this paper we present several algorithms for decomposing all-to-many personalized communication i...
With the advent of new routing methods, the distance to which a message is sent is becoming relative...
In this paper we present several algorithms for performing all-to-many personalized communication on...
With the advent of new routing methods, the distance to which a message is sent is becoming relative...
In this paper we present several algorithms for all-too-many personalized communications which avoid...
Parallelization of many irregular applications results in unstructured collective communication. In ...
The objective of this research is to propose a low-complexity static scheduling and allocation algor...
Automatic partitioning, scheduling and code generation are of major importance in the development of...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
The objective of this research is to propose a lowcomplexity static scheduling and allocation algori...
Scheduling is an important issue in parallel processing. Most scheduling algorithms makes assigns ta...
In applications requiring very high throughput or which have real-time deadlines, the use of paralle...
A fundamental challenge for parallel computing is to obtain high-level, architecture independent, al...
Communication overhead is one of the main factors that can limit the speedup of parallel programs on...
Hypercube algorithms are developed for a variety of commun-ication-intensive tasks such as transposi...