AMS subject classification: 68Q22, 90C90We discuss in this paper the problem of generating highly efficient code when a n + 1-dimensional nested loop program is executed on a n-dimensional torus/grid of distributed-memory general-purpose machines. We focus on a class of uniform recurrences with non-negative components of the dependency matrix. Using tiling the iteration space strategy we show that minimizing the total running time reduces to solving a non-trivial non-linear integer optimization problem. For the later we present a mathematical framework that enables us to derive an O(n log n) algorithm for finding a good approximate solution. The theoretical evaluations and the experimental results show that the obtained solution approximate...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
International audienceWe consider a system of uniform recurrence equations (URE) of dimension one. W...
In this paper we address the problem of partitioning nested loops with non-uniform (irregular) depen...
this paper how to execute a class of n + 1-dimensional uniform recurrences in SPMD (Single Program M...
Iteration space tiling is a common strategy used by parallelizing compilers and in performance tunin...
Iteration space tiling is a common strategy used by parallelizing compilers to reduce communication ...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
The paper is devoted to the methods of automatic parallelization and software optimization. The auth...
This paper presents a novel approach for the problem of generating tiled code for nested for-loops, ...
This paper addresses the problem of compiling nested loops for distributed memory machines. The rela...
We deal with compiler support for parallelizing perfectly nested loops for coarse-grain distributed ...
In this report we address the issue of loop tiling to minimize the completion time of the loop when ...
This paper addresses the problem of compiling perfectly nested loops for multicomputers (distributed...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
International audienceWe consider a system of uniform recurrence equations (URE) of dimension one. W...
In this paper we address the problem of partitioning nested loops with non-uniform (irregular) depen...
this paper how to execute a class of n + 1-dimensional uniform recurrences in SPMD (Single Program M...
Iteration space tiling is a common strategy used by parallelizing compilers and in performance tunin...
Iteration space tiling is a common strategy used by parallelizing compilers to reduce communication ...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
Many computationally-intensive programs, such as those for differential equations, spatial interpola...
The paper is devoted to the methods of automatic parallelization and software optimization. The auth...
This paper presents a novel approach for the problem of generating tiled code for nested for-loops, ...
This paper addresses the problem of compiling nested loops for distributed memory machines. The rela...
We deal with compiler support for parallelizing perfectly nested loops for coarse-grain distributed ...
In this report we address the issue of loop tiling to minimize the completion time of the loop when ...
This paper addresses the problem of compiling perfectly nested loops for multicomputers (distributed...
In the framework of perfect loop nests with uniform dependences, tiling has been extensively studied...
International audienceWe consider a system of uniform recurrence equations (URE) of dimension one. W...
In this paper we address the problem of partitioning nested loops with non-uniform (irregular) depen...