In the framework of fully permutable loops, tiling has been extensively studied as a source-to-source program transformation. However, little work has been devoted to the mapping and scheduling of the tiles on physical processors. Moreover, targeting heterogeneous computing platforms has, to the best of our knowledge, never been considered. In this paper we extend tiling techniques to the context of limited computational resources with different-speed processors. In particular, we present efficient scheduling and mapping strategies that are asymptotically optimal. The practical usefulness of these strategies is fully demonstrated by MPI experiments on a heterogeneous network of workstations.Dans le cadre des boucles totalement permutables, ...
AbstractIn this tutorial, some basic ideas will be outlined and exemplified how graph transformation...
Here, we deal with a new fine grain parallel-pipelined architecture made up of heterogeneous N arith...
High-level synthesis (HLS) tools offer increased productivity regarding FPGA programming.However, du...
In this report we address the issue of loop tiling to minimize the completion time of the loop when ...
In the data parallel programming style the user usually specifies the data parallelism explicitly so...
In this paper, an efficient algorithm to implement loop partitioning is introduced and evaluated. We...
We study the computational power of rational Piecewise Constant Derivative (PCD) systems. PCD system...
In this paper, an efficient algorithm to simultaneously implement array alignment and data/computati...
In this paper, we survey loop parallelization algorithms, analyzing the dependence representations t...
It is easy to find errors and inefficient parts of a sequential program, by using a standard debugge...
Given a set $L$ of $n$ points in the $d$-dimensional Cartesian space $E^d$, and a query specifying a...
In this thesis, we present developments to the approach used by the LRI Parsys team to automatically...
Introduction Several scheduling algorithms have been developed for constraint satisfaction in real-...
Clusters of multicore/GPU nodes connected with a fast network offer very high therotical peak perfor...
The research presented in this thesis was conducted in the context of the Mosaic C, an experimental,...
AbstractIn this tutorial, some basic ideas will be outlined and exemplified how graph transformation...
Here, we deal with a new fine grain parallel-pipelined architecture made up of heterogeneous N arith...
High-level synthesis (HLS) tools offer increased productivity regarding FPGA programming.However, du...
In this report we address the issue of loop tiling to minimize the completion time of the loop when ...
In the data parallel programming style the user usually specifies the data parallelism explicitly so...
In this paper, an efficient algorithm to implement loop partitioning is introduced and evaluated. We...
We study the computational power of rational Piecewise Constant Derivative (PCD) systems. PCD system...
In this paper, an efficient algorithm to simultaneously implement array alignment and data/computati...
In this paper, we survey loop parallelization algorithms, analyzing the dependence representations t...
It is easy to find errors and inefficient parts of a sequential program, by using a standard debugge...
Given a set $L$ of $n$ points in the $d$-dimensional Cartesian space $E^d$, and a query specifying a...
In this thesis, we present developments to the approach used by the LRI Parsys team to automatically...
Introduction Several scheduling algorithms have been developed for constraint satisfaction in real-...
Clusters of multicore/GPU nodes connected with a fast network offer very high therotical peak perfor...
The research presented in this thesis was conducted in the context of the Mosaic C, an experimental,...
AbstractIn this tutorial, some basic ideas will be outlined and exemplified how graph transformation...
Here, we deal with a new fine grain parallel-pipelined architecture made up of heterogeneous N arith...
High-level synthesis (HLS) tools offer increased productivity regarding FPGA programming.However, du...