Languages such as High Performance Fortran implement parallel algorithms by distributing large data structures across a multicomputer system. To enhance parallelism and reduce communication, it is sometimes beneficial for a programmer to change the distribution between phases of the algorithm. We introduce a new mapping strategy, called the spiral mapping, that reduces the communication overhead of array redistribution. Redistribution using the spiral mapping exploits communication locality and reduces global communication conflicts. We implemented redistribution using the standard linear mapping and the spiral mapping for two dimensional arrays; for 1024 \Theta 1024 arrays, redistribution using the spiral mapping is 36% faster than using ...
[[abstract]]In many scientific applications, array redistribution is usually required to enhance dat...
This paper describes a general compiler optimization technique that reduces communication overhead f...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
Dynamic redistribution of arrays is required very often in programs on distributed memory parallel c...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
High performance Fortran (HPF), as well as its predecessor FortranD, has attracted considerable atte...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
[[abstract]]©1996 IEEE-The synthesis of consecutive array operations or array expressions into a com...
Applications with varying array access patterns require to dynamically change array mappings on dist...
High Performance Fortran (HPF), as well as its predecessor FortranD,has attracted considerable atten...
High Performance Fortran (HPF), as well as its predecessor FortranD, has attracted considerable atte...
Many complicated parallel computing applications are composed of several stages. As the program proc...
This paper describes efficient algorithms for runtime array redistribution in HPF programs. We consi...
Processors as rectilinear mesh Physical Processors ALIGN REALIGN DISTRIBUTE REDISTRIBUTE Figure 1: H...
[[abstract]]An increasing number of programming languages, such as Fortran 90, HPF, and APL, provide...
[[abstract]]In many scientific applications, array redistribution is usually required to enhance dat...
This paper describes a general compiler optimization technique that reduces communication overhead f...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
Dynamic redistribution of arrays is required very often in programs on distributed memory parallel c...
This work was also published as a Rice University thesis/dissertation: http://hdl.handle.net/1911/19...
High performance Fortran (HPF), as well as its predecessor FortranD, has attracted considerable atte...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
[[abstract]]©1996 IEEE-The synthesis of consecutive array operations or array expressions into a com...
Applications with varying array access patterns require to dynamically change array mappings on dist...
High Performance Fortran (HPF), as well as its predecessor FortranD,has attracted considerable atten...
High Performance Fortran (HPF), as well as its predecessor FortranD, has attracted considerable atte...
Many complicated parallel computing applications are composed of several stages. As the program proc...
This paper describes efficient algorithms for runtime array redistribution in HPF programs. We consi...
Processors as rectilinear mesh Physical Processors ALIGN REALIGN DISTRIBUTE REDISTRIBUTE Figure 1: H...
[[abstract]]An increasing number of programming languages, such as Fortran 90, HPF, and APL, provide...
[[abstract]]In many scientific applications, array redistribution is usually required to enhance dat...
This paper describes a general compiler optimization technique that reduces communication overhead f...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...