Code transformations, such as loop tiling and loop fusion, are of key importance for the efficient implementation of stencil compu-tations. However, their direct application to a large code base is costly and severely impacts program maintainability. While re-cently introduced domain-specific languages facilitate the appli-cation of such transformations, they typically still require manual tuning or auto-tuning techniques to select the transformations that yield optimal performance. In this paper, we introduce MODESTO, a model-driven stencil optimization framework, that for a stencil program suggests program transformations optimized for a given target architecture. Initially, we review and categorize data lo-cality transformations for sten...
Performance optimization of stencil computations has been widely studied in the literature, since th...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
Spatial computing devices have been shown to significantly accelerate stencil computations, but have...
Our aim is to apply program transformations to stencil codes, in order to yield highest possible per...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
Although stencil auto-tuning has shown tremendous potential in effectively utilizing architectural r...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
International audienceStencil computation represents an important numerical kernel in scientific com...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
This work introduces a generalized framework for automatically tuning stencil computations to achiev...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
AbstractIt is crucial to optimize stencil computations since they are the core (and most computation...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Performance optimization of stencil computations has been widely studied in the literature, since th...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
Spatial computing devices have been shown to significantly accelerate stencil computations, but have...
Our aim is to apply program transformations to stencil codes, in order to yield highest possible per...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
Although stencil auto-tuning has shown tremendous potential in effectively utilizing architectural r...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
International audienceStencil computation represents an important numerical kernel in scientific com...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
This work introduces a generalized framework for automatically tuning stencil computations to achiev...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
AbstractIt is crucial to optimize stencil computations since they are the core (and most computation...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Performance optimization of stencil computations has been widely studied in the literature, since th...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
Spatial computing devices have been shown to significantly accelerate stencil computations, but have...