This paper describes a new technique for optimizing serial and parallel stencil- and stencil-like operations for cache-based architectures. This technique takes advantage of the semantic knowledge implicity in stencil-like computations. The technique is implemented as a source-to-source program transformation; because of its specificity it could not be expected of a conventional compiler. Empirical results demonstrate a uniform factor of two speedup. The experiments clearly show the benefits of this technique to be a consequence, as intended, of the reduction in cache misses. The test codes are based on a 5-point stencil obtained by the discretization of the Poisson equation and applied to a two-dimensional uniform grid using the Jacobi met...
International audienceStencil computation represents an important numerical kernel in scientific com...
High-performance scientific computing relies increasingly on high-level large-scale object-oriented ...
We are witnessing a fundamental paradigm shift in computer design. Memory has been and is becoming m...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
We present a new cache oblivious scheme for iterative stencil computations that performs beyond syst...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
New algorithms and optimization techniques are needed to balance the accelerating trend towards band...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Performance optimization of stencil computations has been widely studied in the literature, since th...
Stencil-based kernels constitute the core of many scientific applications on block-structured grids....
AbstractIt is crucial to optimize stencil computations since they are the core (and most computation...
Our aim is to apply program transformations to stencil codes, in order to yield highest possible per...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Abstract. This paper proposes tiling techniques based on data depen-dencies and not in code structur...
International audienceStencil computation represents an important numerical kernel in scientific com...
High-performance scientific computing relies increasingly on high-level large-scale object-oriented ...
We are witnessing a fundamental paradigm shift in computer design. Memory has been and is becoming m...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
We present a new cache oblivious scheme for iterative stencil computations that performs beyond syst...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
New algorithms and optimization techniques are needed to balance the accelerating trend towards band...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Performance optimization of stencil computations has been widely studied in the literature, since th...
Stencil-based kernels constitute the core of many scientific applications on block-structured grids....
AbstractIt is crucial to optimize stencil computations since they are the core (and most computation...
Our aim is to apply program transformations to stencil codes, in order to yield highest possible per...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Abstract. This paper proposes tiling techniques based on data depen-dencies and not in code structur...
International audienceStencil computation represents an important numerical kernel in scientific com...
High-performance scientific computing relies increasingly on high-level large-scale object-oriented ...
We are witnessing a fundamental paradigm shift in computer design. Memory has been and is becoming m...