Appropriate data distribution has been found to be critical for obtaining good performance on Distributed Memory Multicomputers like the CM-5, Intel Paragon and IBM SP-1. It has also been found that some programs need to change their distributions during execution for better performance (redistribution). This work focuses on automatically generating efficient routines for redistribution. We present a new mathematical representation for regular distributions called PITFALLS and then discuss algorithms for redistribution based on this representation. One of the significant contributions of this work is being able to handle arbitrary source and target processor sets while performing redistribution. Another important contribution is the ability...
Abstract. In many scientic applications, array redistribution is usually required to enhance data lo...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
Many complicated parallel computing applications are composed of several stages. As the program proc...
Dynamic redistribution of arrays is required very often in programs on distributed memory parallel c...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
[[abstract]]In many scientific applications, array redistribution is usually required to enhance dat...
For distributed memory multicomputers such as the Intel Paragon, the IBM SP-2, the NCUBE/2, and the ...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
We consider distribution at compile time of the array data in a distributed-memory implementation of...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
This paper describes efficient algorithms for runtime array redistribution in HPF programs. We consi...
Languages such as High Performance Fortran implement parallel algorithms by distributing large data ...
Abstract. In many scientic applications, array redistribution is usually required to enhance data lo...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
Many complicated parallel computing applications are composed of several stages. As the program proc...
Dynamic redistribution of arrays is required very often in programs on distributed memory parallel c...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
[[abstract]]In many scientific applications, array redistribution is usually required to enhance dat...
For distributed memory multicomputers such as the Intel Paragon, the IBM SP-2, the NCUBE/2, and the ...
An important problem facing parallelizing compilers for distributed memory mimd machines is that of ...
We consider distribution at compile time of the array data in a distributed-memory implementation of...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
Distributed-memory multicomputers, such as the Intel iPSC/860, the Intel Paragon, the IBM SP-1 /SP-2...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
This paper describes efficient algorithms for runtime array redistribution in HPF programs. We consi...
Languages such as High Performance Fortran implement parallel algorithms by distributing large data ...
Abstract. In many scientic applications, array redistribution is usually required to enhance data lo...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
Many complicated parallel computing applications are composed of several stages. As the program proc...