Data distribution is one of the key aspects that a parallelizing compiler for a distributed memory architecture should consider, in order to get efficiency from the system. The cost of accessing local and remote data can be one or several orders of magnitude different, and this can dramatically affect performance. In this report, we propose a novel approach to automatically perform static data distribution. All the constraints related to parallelism and data movement are contained in a single data structure, the Communication-Parallelism Graph (CPG). The problem is solved using a linear 0-1 integer programming model and solver. This allows us to find the best solution for the problem for one-dimensional array distributions. An iterative app...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
This paper describes an automatic data distribution method which deal with both the alignment and th...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Data parallel programming languages, such as HPF, are the easiest way to program Distributed Memory ...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
We consider distribution at compile time of the array data in a distributed-memory implementation of...
We present algorithms for the transportation of data in parallel and distributed systems that would ...
An algorithm for mapping an arbitrary, multidimensional array onto an arbitrarily shaped multidimens...
We present an intermediate representation of a program called the Alignment-Distribution Graph that ...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
In this paper we present a method to obtain a set of candidate distributions for a program fragment....
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1992.Distributed-memory parallel c...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
This paper describes an automatic data distribution method which deal with both the alignment and th...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Data parallel programming languages, such as HPF, are the easiest way to program Distributed Memory ...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
We consider distribution at compile time of the array data in a distributed-memory implementation of...
We present algorithms for the transportation of data in parallel and distributed systems that would ...
An algorithm for mapping an arbitrary, multidimensional array onto an arbitrarily shaped multidimens...
We present an intermediate representation of a program called the Alignment-Distribution Graph that ...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
In this paper we present a method to obtain a set of candidate distributions for a program fragment....
160 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1992.Distributed-memory parallel c...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...