Abstract — There exist several scheduling schemes for parallelizing loops without dependences for shared and distributed memory systems. However, efficiently parallelizing loops with dependences is a more complicated task. This becomes even more difficult when the loops are executed on a distributed memory cluster where communication and synchronization can be a bottleneck. The problem lies in the processor idle time which occurs during the beginning and final stages of the execution. In this paper we propose a new scheduling scheme that minimizes the processor idle time and thus it enhances load balancing and performance. The new scheme is applied to two-dimensional iteration spaces with dependences. The proposed scheduling scheme follows ...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
In the framework of fully permutable loops, tiling has been extensively studied as a source-to-sour...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
This paper addresses the problem of compiling nested loops for distributed memory machines. The rela...
This work addresses the issue of exploiting intra-tile parallelism by overlapping communication with...
We deal with compiler support for parallelizing perfectly nested loops for coarse-grain distributed ...
In this report we address the issue of loop tiling to minimize the completion time of the loop when ...
This paper investigates the idle time associated with a parallel computation, that is, the time that...
This paper investigates the idle time associated with a parallel computation, that is, the time that...
Iteration space tiling is a common strategy used by parallelizing compilers and in performance tunin...
International audienceIn the framework of fully permutable loops, tiling is a compiler technique (al...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
Subdividing the iteration space of a loop into blocks or tiles with a fixed maximum size has several...
This paper addresses the problem of compiling perfectly nested loops for multicomputers (distributed...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
In the framework of fully permutable loops, tiling has been extensively studied as a source-to-sour...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
This paper addresses the problem of compiling nested loops for distributed memory machines. The rela...
This work addresses the issue of exploiting intra-tile parallelism by overlapping communication with...
We deal with compiler support for parallelizing perfectly nested loops for coarse-grain distributed ...
In this report we address the issue of loop tiling to minimize the completion time of the loop when ...
This paper investigates the idle time associated with a parallel computation, that is, the time that...
This paper investigates the idle time associated with a parallel computation, that is, the time that...
Iteration space tiling is a common strategy used by parallelizing compilers and in performance tunin...
International audienceIn the framework of fully permutable loops, tiling is a compiler technique (al...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
Subdividing the iteration space of a loop into blocks or tiles with a fixed maximum size has several...
This paper addresses the problem of compiling perfectly nested loops for multicomputers (distributed...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
In the framework of fully permutable loops, tiling has been extensively studied as a source-to-sour...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...