Although Single Instruction Multiple Data (SIMD) units are available in general purpose processors already since the 1990s, state-of-the-art compilers are often still not capable to fully exploit them, i.e., they may miss to achieve the best possible performance. We present a new hardware-aware and adaptive loop tiling approach that is based on polyhedral transformations and explicitly dedicated to improve on auto-vectorization. It is an extension to the tiling algorithm implemented within the PluTo framework. In its default setting, PluTo uses static tile sizes and is already capable to enable the use of SIMD units but not primarily targeted to optimize it. We experimented with different tile sizes and found a strong relationship between t...
Many advances in automatic parallelization and optimization have been achieved through the polyhedra...
2013 Spring.Includes bibliographical references.With the introduction of multi-core processors, moti...
On modern architectures, a missed optimization can translate into performance degradations reaching ...
Although Single Instruction Multiple Data (SIMD) units are available in general purpose processors a...
Although Single Instruction Multiple Data (SIMD) units are available in general purpose processors a...
Modern compilers offer more and more capabilities to automatically parallelize code-regions if these...
International audienceOptimizing compilers apply numerous inter- dependent optimizations, leading to...
The Polyhedral model has proven to be a valuable tool for improving memory locality and exploiting p...
Multi-core processors are now in widespread use in almost all areas of computing: desktops, laptops ...
The polyhedral model is known to be a powerful framework to reason about high level loop transformat...
International audienceLoop tiling is a loop transformation widely used to improve spatial and tempor...
The Single Instruction Multiple Data (SIMD) paradigm promises speedup at relatively low silicon area...
Loop tiling is a loop transformation widely used to improve spatial and temporal data locality, to i...
International audienceThe polyhedral model is a powerful framework for automatic optimization and pa...
In this paper, we discuss techniques to transform sequential programs to texture/surface memory opt...
Many advances in automatic parallelization and optimization have been achieved through the polyhedra...
2013 Spring.Includes bibliographical references.With the introduction of multi-core processors, moti...
On modern architectures, a missed optimization can translate into performance degradations reaching ...
Although Single Instruction Multiple Data (SIMD) units are available in general purpose processors a...
Although Single Instruction Multiple Data (SIMD) units are available in general purpose processors a...
Modern compilers offer more and more capabilities to automatically parallelize code-regions if these...
International audienceOptimizing compilers apply numerous inter- dependent optimizations, leading to...
The Polyhedral model has proven to be a valuable tool for improving memory locality and exploiting p...
Multi-core processors are now in widespread use in almost all areas of computing: desktops, laptops ...
The polyhedral model is known to be a powerful framework to reason about high level loop transformat...
International audienceLoop tiling is a loop transformation widely used to improve spatial and tempor...
The Single Instruction Multiple Data (SIMD) paradigm promises speedup at relatively low silicon area...
Loop tiling is a loop transformation widely used to improve spatial and temporal data locality, to i...
International audienceThe polyhedral model is a powerful framework for automatic optimization and pa...
In this paper, we discuss techniques to transform sequential programs to texture/surface memory opt...
Many advances in automatic parallelization and optimization have been achieved through the polyhedra...
2013 Spring.Includes bibliographical references.With the introduction of multi-core processors, moti...
On modern architectures, a missed optimization can translate into performance degradations reaching ...