Estimating communication cost involved in executing a program on distributed memory machines is important for evaluating the overheads due to repartitioning. We present a scheme which will work with reasonable efficiency for arrays with at most 3 dimensions. Hyperplane Partitioning technique given by [10] is extended to complete programs by estimating the communication cost by the scheme presented in this work. 1 Introduction Any strategy for automatic data partitioning needs a way of estimating communication cost involved when we implement the strategy on a target architecture. Many researchers have developed tools [3, 4, 5] that estimate the performance of a parallel program with explicit communication at compile-time. In [2], they empl...
this paper we will present a solution to the problem of determining loop and data partitions automat...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
Distributed shared-memory systems provide scalable performance and a convenient model for parallel p...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
Most of the current compiler projects for distributed memory architectures leave the critical and ti...
Automatic Global Data Partitioning for Distributed Memory Machines (DMMs) is a difficult problem. Di...
An important problem facing numerous research projects on parallelizing compilers for distributed me...
This paper addresses the problem of partitioning data for distributed memory machines or multicomput...
This paper describes a number of optimizations that can be used to support the efficient execution o...
We present an intermediate representation of a program called the Alignment-Distribution Graph that ...
An approach to distributed memory parallel programming that has recently become popular is one wher...
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1992.Distributed-memory parallel c...
An approach to distributed memory parallel programming that has recently become popular is one wher...
this paper we will present a solution to the problem of determining loop and data partitions automat...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
Distributed shared-memory systems provide scalable performance and a convenient model for parallel p...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
Most of the current compiler projects for distributed memory architectures leave the critical and ti...
Automatic Global Data Partitioning for Distributed Memory Machines (DMMs) is a difficult problem. Di...
An important problem facing numerous research projects on parallelizing compilers for distributed me...
This paper addresses the problem of partitioning data for distributed memory machines or multicomput...
This paper describes a number of optimizations that can be used to support the efficient execution o...
We present an intermediate representation of a program called the Alignment-Distribution Graph that ...
An approach to distributed memory parallel programming that has recently become popular is one wher...
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1992.Distributed-memory parallel c...
An approach to distributed memory parallel programming that has recently become popular is one wher...
this paper we will present a solution to the problem of determining loop and data partitions automat...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
Distributed shared-memory systems provide scalable performance and a convenient model for parallel p...