Run-time array redistribution is necessary to enhance the performance of parallel programs on distributed memory supercomputers. In this paper, we present an efficient algorithm for array redistribution from cyclic(x) on P processors to cyclic(Kx) on Q processors. The algorithm reduces the overall time for communication by considering the data transfer, communication schedule, and index computation costs. The proposed algorithm is based on a generalized circulant matrix formalism. Our algorithm generates a schedule that minimizes the number of communication steps and eliminates node contention in each communication step. The network bandwidth is fully utilized by ensuring that equal-sized messages are transferred in each communication step....
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
[[abstract]]In many scientific applications, dynamic array redistribution is usually required to enh...
International audienceThis article is devoted to the run-time redistribution of one-dimensional arra...
International audienceThis article is devoted to the run-time redistribution of one-dimensional arra...
International audienceThis article is devoted to the run-time redistribution of one-dimensional arra...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
International audienceThis article is devoted to the run-time redistribution of one-dimensional arra...
This article is devoted to the run-time redistribution of one-dimensional arrays that are distribute...
This article is devoted to the run-time redistribution of one-dimensional arrays that are distribute...
International audienceThis article is devoted to the run-time redistribution of one-dimensional arra...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
International audienceThis article deals with the run-time comparison between several algorithms (in...
International audienceThis article deals with the run-time comparison between several algorithms (in...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
[[abstract]]In many scientific applications, dynamic array redistribution is usually required to enh...
International audienceThis article is devoted to the run-time redistribution of one-dimensional arra...
International audienceThis article is devoted to the run-time redistribution of one-dimensional arra...
International audienceThis article is devoted to the run-time redistribution of one-dimensional arra...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
International audienceThis article is devoted to the run-time redistribution of one-dimensional arra...
This article is devoted to the run-time redistribution of one-dimensional arrays that are distribute...
This article is devoted to the run-time redistribution of one-dimensional arrays that are distribute...
International audienceThis article is devoted to the run-time redistribution of one-dimensional arra...
[[abstract]]Array redistribution is usually required, to enhance algorithm performance in many paral...
International audienceThis article deals with the run-time comparison between several algorithms (in...
International audienceThis article deals with the run-time comparison between several algorithms (in...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
[[abstract]]Array redistribution is usually required to enhance algorithm performance in many parall...
Abstract. Array redistribution is required often in programs on distributed memory parallel computer...
[[abstract]]In many scientific applications, dynamic array redistribution is usually required to enh...