International audienceThe polyhedral model is powerful for analyzing and transforming static control programs, hence its intensive use for the optimization of data locality and automatic paralleliza-tion. Affine transformations excel at modeling control flow, to promote data reuse and to expose parallelism. The approach has also successfully been applied to the optimization of memory accesses (array expansion and contraction), although the available tools in the area are not as mature. Yet data locality also depends on other parameters such as data layout and data placement relatively to the memory hierarchy; these include spatial locality in cache lines and scalability on NUMA systems. This paper presents Ivie, a parallel intermediate lang...
This paper presents a data layout optimization technique based on the theory of hyperplanes from lin...
Due to their excellent price-performance ratio, clusters built from commodity nodes have become broa...
A common feature of many scalable parallel machines is non-uniform memory access - a processor can...
International audienceThe polyhedral model is powerful for analyzing and transforming static control...
Despite decades of work in this area, the construction of effective loop nest optimizers and paralle...
A common feature of many scalable parallel machines is non-uniform memory access (NUMA) --- data acc...
Abstract—With increasing numbers of cores, future CMPs (Chip Multi-Processors) are likely to have a ...
We present a unified approach to locality optimization that employs both data and control transforma...
The polyhedron model has been used successfully for auto-matic parallelization of code regions with ...
International audienceWith increasing numbers of cores, future CMPs (Chip Multi-Processors) are like...
this paper, we describe a framework for loop transformations and code generation for NUMA (non-unifo...
Current high-performance multicore processors provide users with a non-uniform memory access model (...
Many applications are memory intensive and thus are bounded by memory latency and bandwidth. While i...
The polyhedral model for loop parallelization has proved to be an effective tool for ad-vanced optim...
Abstract—The polyhedral model is an algebraic framework for affine program representations and trans...
This paper presents a data layout optimization technique based on the theory of hyperplanes from lin...
Due to their excellent price-performance ratio, clusters built from commodity nodes have become broa...
A common feature of many scalable parallel machines is non-uniform memory access - a processor can...
International audienceThe polyhedral model is powerful for analyzing and transforming static control...
Despite decades of work in this area, the construction of effective loop nest optimizers and paralle...
A common feature of many scalable parallel machines is non-uniform memory access (NUMA) --- data acc...
Abstract—With increasing numbers of cores, future CMPs (Chip Multi-Processors) are likely to have a ...
We present a unified approach to locality optimization that employs both data and control transforma...
The polyhedron model has been used successfully for auto-matic parallelization of code regions with ...
International audienceWith increasing numbers of cores, future CMPs (Chip Multi-Processors) are like...
this paper, we describe a framework for loop transformations and code generation for NUMA (non-unifo...
Current high-performance multicore processors provide users with a non-uniform memory access model (...
Many applications are memory intensive and thus are bounded by memory latency and bandwidth. While i...
The polyhedral model for loop parallelization has proved to be an effective tool for ad-vanced optim...
Abstract—The polyhedral model is an algebraic framework for affine program representations and trans...
This paper presents a data layout optimization technique based on the theory of hyperplanes from lin...
Due to their excellent price-performance ratio, clusters built from commodity nodes have become broa...
A common feature of many scalable parallel machines is non-uniform memory access - a processor can...