Time-tiling is necessary for efficient execution of iterative stencil computations. But the usual hyper-rectangular tiles cannot be used because of positive/negative dependence distances along the stencil's spatial dimensions. Several prior efforts have addressed this issue. However, known techniques trade enhanced data reuse for other causes of inefficiency, such as unbalanced parallelism, redundant computations, or increased control flow overhead incompatible with efficient GPU execution. We explore a new path to maximize the effectivness of time-tiling on iterative stencil computations. Our approach is particularly well suited for GPUs. It does not require any redundant computations, it favors coalesced global-memory access and data reus...
Highly-realistic materials in computer graphics are computationally and memory demanding. Currently,...
In the context of this thesis, our focus is on numerical linear algebra, more precisely on solution ...
Task-based systems have gained popularity because of their promise of exploiting the computational p...
Loop tiling is a loop transformation widely used to improve spatial and temporal data locality, to i...
While studied over several decades, the computation of boolean operations on polyhedra is almost alw...
In the framework of fully permutable loops, tiling has been extensively studied as a source-to-sourc...
Whereas most today parallel High Performance Computing (HPC) software is written as highly tuned cod...
In this report we address the issue of loop tiling to minimize the completion time of the loop when ...
Despite decades of research on high-level loop optimizations and theirsuccessful integration in prod...
In this thesis, we present our contributions to APOLLO (Automatic speculative POLyhedral Loop Optimi...
The goal of this thesis is to devise methods and algorithms for the automatic generation of isotropi...
The computation capabilities of mobile GPUs have greatly evolved in the last generations, allowing r...
In this thesis, we present our contributions to APOLLO: an automatic parallelization compiler that c...
Scheduling problems are already difficult on traditional parallel machines. They becomeextremely cha...
National audienceThe race for ever more computing power raises the issue of supercomputers' power co...
Highly-realistic materials in computer graphics are computationally and memory demanding. Currently,...
In the context of this thesis, our focus is on numerical linear algebra, more precisely on solution ...
Task-based systems have gained popularity because of their promise of exploiting the computational p...
Loop tiling is a loop transformation widely used to improve spatial and temporal data locality, to i...
While studied over several decades, the computation of boolean operations on polyhedra is almost alw...
In the framework of fully permutable loops, tiling has been extensively studied as a source-to-sourc...
Whereas most today parallel High Performance Computing (HPC) software is written as highly tuned cod...
In this report we address the issue of loop tiling to minimize the completion time of the loop when ...
Despite decades of research on high-level loop optimizations and theirsuccessful integration in prod...
In this thesis, we present our contributions to APOLLO (Automatic speculative POLyhedral Loop Optimi...
The goal of this thesis is to devise methods and algorithms for the automatic generation of isotropi...
The computation capabilities of mobile GPUs have greatly evolved in the last generations, allowing r...
In this thesis, we present our contributions to APOLLO: an automatic parallelization compiler that c...
Scheduling problems are already difficult on traditional parallel machines. They becomeextremely cha...
National audienceThe race for ever more computing power raises the issue of supercomputers' power co...
Highly-realistic materials in computer graphics are computationally and memory demanding. Currently,...
In the context of this thesis, our focus is on numerical linear algebra, more precisely on solution ...
Task-based systems have gained popularity because of their promise of exploiting the computational p...