Abstract. Array redistribution is required often in programs on distributed memory parallel computers. It is essential to use efficient algorithms for redistribution; otherwise the performance of the programs will degrade considerably. The redistribution overheads consist of two parts: index computation and inter-processor communication. In this paper, by using a notation for the local data description called an LDD, we propose a framework to optimize the array redistribution algorithm both in index computation and inter-processor communication. That is, our work makes an effort to optimize not only the computation cost but also communication cost for array redistribution algorithms. We present an efficient index computation method and gene...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
[[abstract]]In this paper, we present message encoding techniques to improve the performance of BLOC...
[[abstract]]In many scientific applications, array redistribution is usually required to enhance dat...
Run-time array redistribution is necessary to enhance the performance of parallel programs on distri...
Abstract. In many scientic applications, array redistribution is usually required to enhance data lo...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
Languages such as High Performance Fortran implement parallel algorithms by distributing large data ...
Dynamic redistribution of arrays is required very often in programs on distributed memory parallel c...
This paper presents a framework to find good distributions for the global arrays at different progra...
[[abstract]]In this paper, we present efficient methods for multidimensional array redistribution. B...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
[[abstract]]In this paper, we present message encoding techniques to improve the performance of BLOC...
[[abstract]]In many scientific applications, array redistribution is usually required to enhance dat...
Run-time array redistribution is necessary to enhance the performance of parallel programs on distri...
Abstract. In many scientic applications, array redistribution is usually required to enhance data lo...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
Languages such as High Performance Fortran implement parallel algorithms by distributing large data ...
Dynamic redistribution of arrays is required very often in programs on distributed memory parallel c...
This paper presents a framework to find good distributions for the global arrays at different progra...
[[abstract]]In this paper, we present efficient methods for multidimensional array redistribution. B...
This paper addresses the problem of partitioning data for distributed memory machines (multicomputer...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
[[abstract]]In distributed memory multicomputers, local memory accesses are much faster than those i...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
Estimating communication cost involved in executing a program on distributed memory machines is impo...
[[abstract]]In this paper, we present message encoding techniques to improve the performance of BLOC...