An important research topic is parallelizing of compilers to generate local memory access sequences and communication sets while compiling a data-parallel language into an SPMD (Single Program Multiple Data) program. In this paper, we present a scheme to efficiently enumerate local memory access sequences and to evaluate communication sets. We use a class table to store information that is extracted from array sections and data distribution patterns. Given array references and data distributions, we can utilize the class table to generate communication sets in closed forms. Furthermore, we derive the algorithms for sending and receiving necessary data between processors. An algorithm for generating the class table is presented, and the time...
International audienceHigh Performance Fortran and other similar languages have been designed as a m...
We present new techniques for compilation of arbitrarily nested loops with affine dependences for di...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
[[abstract]]Address generation for compiling programs, written in HPF, to executable SPMD code is an...
[[abstract]]This paper presents an efficient compilation technique to generate the local memory acce...
Data-parallel languages, such as High Performance Fortran, are designed to make programming of distr...
Arrays are mapped to processors through a two-step process---alignment followed by distribution---in...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
[[abstract]]An increasing number of programming languages, such as Fortran 90, HPF, and APL, provide...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
Distributed memory multiprocessors are increasingly being used to provide high performance for advan...
Communication set generation significantly influences the performance of parallel programs. However...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
This paper presents compilation techniques used to compress holes, which are caused by the nonunit a...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
International audienceHigh Performance Fortran and other similar languages have been designed as a m...
We present new techniques for compilation of arbitrarily nested loops with affine dependences for di...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
[[abstract]]Address generation for compiling programs, written in HPF, to executable SPMD code is an...
[[abstract]]This paper presents an efficient compilation technique to generate the local memory acce...
Data-parallel languages, such as High Performance Fortran, are designed to make programming of distr...
Arrays are mapped to processors through a two-step process---alignment followed by distribution---in...
Data-parallel languages allow programmers to use the familiar machine-independent programming style ...
[[abstract]]An increasing number of programming languages, such as Fortran 90, HPF, and APL, provide...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
Distributed memory multiprocessors are increasingly being used to provide high performance for advan...
Communication set generation significantly influences the performance of parallel programs. However...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
This paper presents compilation techniques used to compress holes, which are caused by the nonunit a...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
International audienceHigh Performance Fortran and other similar languages have been designed as a m...
We present new techniques for compilation of arbitrarily nested loops with affine dependences for di...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...