Parallelization of many irregular applications results in unstructured collective communication. In this paper we present a distributed algorithm for scheduling such communication on parallel machines. We describe the performance of this algorithm on the CM-5 and show that the scheduling algorithm has very small overhead and gives a significant improvement over naive methods
Collective communication allows efficient communication and synchronization among a collection of pr...
This paper tackles the problem of parallelizing heterogeneous computational tasks across a number of...
Institute for Computing Systems ArchitectureProgrammers face daunting problems when attempting to de...
In this paper we present several algorithms for performing all-to-many personalized communication on...
In this paper, we study the communication characteristics of the CM-5 and the performance effects of...
In this paper we present several algorithms for all-too-many personalized communications which avoid...
This paper presents solutions for the problem of many-to-many personalized communication, with bound...
In this paper we present several algorithms for decomposing all-to-many personalized communication i...
With the advent of new routing methods, the distance to which a message is sent is becoming relative...
This paper presents algorithms for implementing the transportation primitive on a distributed memory...
With the advent of new routing methods, the distance to which a message is sent is becoming relative...
This thesis focus on the problem of scheduling the tasks of a parallel application taking into accou...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
Scheduling is an important issue in parallel processing. Most scheduling algorithms makes assigns ta...
Abstract. In the context of generating efficient, contention free schedules for inter-node communica...
Collective communication allows efficient communication and synchronization among a collection of pr...
This paper tackles the problem of parallelizing heterogeneous computational tasks across a number of...
Institute for Computing Systems ArchitectureProgrammers face daunting problems when attempting to de...
In this paper we present several algorithms for performing all-to-many personalized communication on...
In this paper, we study the communication characteristics of the CM-5 and the performance effects of...
In this paper we present several algorithms for all-too-many personalized communications which avoid...
This paper presents solutions for the problem of many-to-many personalized communication, with bound...
In this paper we present several algorithms for decomposing all-to-many personalized communication i...
With the advent of new routing methods, the distance to which a message is sent is becoming relative...
This paper presents algorithms for implementing the transportation primitive on a distributed memory...
With the advent of new routing methods, the distance to which a message is sent is becoming relative...
This thesis focus on the problem of scheduling the tasks of a parallel application taking into accou...
This work presents and evaluates algorithms for MPI collective communication operations on high perf...
Scheduling is an important issue in parallel processing. Most scheduling algorithms makes assigns ta...
Abstract. In the context of generating efficient, contention free schedules for inter-node communica...
Collective communication allows efficient communication and synchronization among a collection of pr...
This paper tackles the problem of parallelizing heterogeneous computational tasks across a number of...
Institute for Computing Systems ArchitectureProgrammers face daunting problems when attempting to de...