Determining an appropriate data distribution among different memories is critical to the performance of data-parallel programs on distributedmemory machines. By analyzing the computational load of data arrays and the communication complexity of various data movement operations in a program, this paper suggests a first-order cost model for determining a small set of appropriate data distribution patterns among many possible choices. A new data distribution specification, namely CYBLOCK, is proposed to enhance the expressiveness of data distribution specifications being proposed in High Performance Fortran. Cost analysis of two case studies: a linear system solver and a Purdue-set benchmark loop, are used to illustrate the proposed evaluation...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/1...
Distributed-memory multiprocessing systems (DMS), such as Intel’s hypercubes, the Paragon, Thinking ...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Abstract: High performance computing (HPC) architectures are specialized machines which can reach th...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
this report we have described how two methods for automatically determining convenient data distribu...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
The goal of languages like Fortran D or High Performance Fortran (HPF) is to provide a simple yet ef...
High performance computing (HPC) architectures are specialized machines which can reach their peak p...
This paper presents a framework to find good distributions for the global arrays at different progra...
This paper proposes a compiler strategy for mapping FORTRAN programs onto distributed memory compute...
This paper presents a framework for characterizing the distribution of fine-grained parallelism, dat...
The choice of a good data distribution scheme is critical to performance of data-parallel applicatio...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/1...
Distributed-memory multiprocessing systems (DMS), such as Intel’s hypercubes, the Paragon, Thinking ...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Abstract: High performance computing (HPC) architectures are specialized machines which can reach th...
Massively Parallel Processor systems provide the required computational power to solve most large sc...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
this report we have described how two methods for automatically determining convenient data distribu...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
The goal of languages like Fortran D or High Performance Fortran (HPF) is to provide a simple yet ef...
High performance computing (HPC) architectures are specialized machines which can reach their peak p...
This paper presents a framework to find good distributions for the global arrays at different progra...
This paper proposes a compiler strategy for mapping FORTRAN programs onto distributed memory compute...
This paper presents a framework for characterizing the distribution of fine-grained parallelism, dat...
The choice of a good data distribution scheme is critical to performance of data-parallel applicatio...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/1...
Distributed-memory multiprocessing systems (DMS), such as Intel’s hypercubes, the Paragon, Thinking ...