This paper presents a framework to find good distributions for the global arrays at different program points in the presence of procedure calls. The distributions are chosen for their ability to offset the redistribution overheads by contributing significantly towards increasing the performance gains. The algorithm uses interprocedural analysis and dynamic programming techniques. The working of the algorithm has been demonstrated for a CFD kernel
This paper proposes a compiler strategy for mapping FORTRAN programs onto distributed memory compute...
Shared-memory multiprocessor systems can achieve high performance levels when appropriate work paral...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
Languages such as High Performance Fortran implement parallel algorithms by distributing large data ...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
Data parallel languages like High Performance Fortran (HPF) are emerging as the architecture indepen...
Determining an appropriate data distribution among different memories is critical to the performance...
We consider distribution at compile time of the array data in a distributed-memory implementation of...
Data parallel languages like High Performance Fortran (HPF) are emerging as the architecture indepen...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Many complicated parallel computing applications are composed of several stages. As the program proc...
Dynamic redistribution of arrays is required very often in programs on distributed memory parallel c...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
This paper proposes a compiler strategy for mapping FORTRAN programs onto distributed memory compute...
Shared-memory multiprocessor systems can achieve high performance levels when appropriate work paral...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
Languages such as High Performance Fortran implement parallel algorithms by distributing large data ...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
Data parallel languages like High Performance Fortran (HPF) are emerging as the architecture indepen...
Determining an appropriate data distribution among different memories is critical to the performance...
We consider distribution at compile time of the array data in a distributed-memory implementation of...
Data parallel languages like High Performance Fortran (HPF) are emerging as the architecture indepen...
This paper describes a number of optimizations that can be used to support the efficient execution o...
Many complicated parallel computing applications are composed of several stages. As the program proc...
Dynamic redistribution of arrays is required very often in programs on distributed memory parallel c...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
This paper proposes a compiler strategy for mapping FORTRAN programs onto distributed memory compute...
Shared-memory multiprocessor systems can achieve high performance levels when appropriate work paral...
On shared memory parallel computers (SMPCs) it is natural to focus on decomposing the computation (...