The challenge of extreme scale computing will test the limits of our ability to scale computa-tional infrastructure. While much attention has been given to the scalability of hardware designs and of the novel algorithms to be run thereon, and significant practical success has been achieved with data-flow-based automatic parallelization of dense array codes, current automatic parallelizers focus almost exclusively on transformations that are inherently not fully scalable. We discuss the limitations on asymptotic scalability of the transformations applied by successful automatic parallelizers like PLuTo, and review the literature of other approaches to this problem. As part of this survey, we discuss both the scalability and imple-mentation s...
International audienceAffine transformations have proven to be powerful for loop restructuring due t...
Affine transformations have proven to be powerful for loop restructuring due to their ability to mod...
Characteristics of full applications found in scientific computing industries today lead to challeng...
The Polyhedral model has proven to be a valuable tool for improving memory locality and exploiting p...
Modern compilers offer more and more capabilities to automatically parallelize code-regions if these...
Modern computers will increasingly rely on parallelism to achieve high computation rates. Techniques...
Although Single Instruction Multiple Data (SIMD) units are available in general purpose processors a...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Abstract—Increasingly, the main bottleneck limiting performance on emerging multi-core and many-core...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
Parallel processing has been used to increase performance of computing systems for the past several ...
Recent advances in polyhedral compilation technology have made it feasible to automatically transfor...
The model-based transformation of loop programs is a way of detecting fine-grained parallelism in se...
The limited ability of compilers to nd the parallelism in programs is a signi cant barrier to the us...
Automatic parallelization techniques for finding loop-based parallelism fail to find efficient paral...
International audienceAffine transformations have proven to be powerful for loop restructuring due t...
Affine transformations have proven to be powerful for loop restructuring due to their ability to mod...
Characteristics of full applications found in scientific computing industries today lead to challeng...
The Polyhedral model has proven to be a valuable tool for improving memory locality and exploiting p...
Modern compilers offer more and more capabilities to automatically parallelize code-regions if these...
Modern computers will increasingly rely on parallelism to achieve high computation rates. Techniques...
Although Single Instruction Multiple Data (SIMD) units are available in general purpose processors a...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Abstract—Increasingly, the main bottleneck limiting performance on emerging multi-core and many-core...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
Parallel processing has been used to increase performance of computing systems for the past several ...
Recent advances in polyhedral compilation technology have made it feasible to automatically transfor...
The model-based transformation of loop programs is a way of detecting fine-grained parallelism in se...
The limited ability of compilers to nd the parallelism in programs is a signi cant barrier to the us...
Automatic parallelization techniques for finding loop-based parallelism fail to find efficient paral...
International audienceAffine transformations have proven to be powerful for loop restructuring due t...
Affine transformations have proven to be powerful for loop restructuring due to their ability to mod...
Characteristics of full applications found in scientific computing industries today lead to challeng...