In this thesis, we introduce a new optimization theory for stencil-based applications which is centered both on a modification of the well known owner-computes rule and on base but powerful properties oftoroidal spaces. The proposed optimization techniques provide notable results in different computational aspects: from the reduction of communication overhead to the reduction of computation time, through the minimization of memory requirement without performance loss. All classical optimization theory is based on defining transformations that can produce optimized programs which are computationally equivalent to the original ones. According to Kennedy, two programs are equivalent if, from the same input data, they produce identical output data...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
This paper describes a new technique for optimizing serial and parallel stencil- and stencil-like op...
The optimization of data parallel programs is a challenging open problem. We analyzed in detail the ...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
dissertationStencil computations are operations on structured grids. They are frequently found in pa...
Stencil computations are widely used from physical simulations to machine-learning. They are embarra...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
Stencil computations are a widely used type of algorithm, found in applications from physical simula...
This research explores the idea that for certain optimization problems there is a way to parallelize...
This thesis studies the techniques of tiling optimizations for stencil programs. Traditionally, res...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
International audienceStencil computation represents an important numerical kernel in scientific com...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Code transformations, such as loop tiling and loop fusion, are of key importance for the efficient i...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
This paper describes a new technique for optimizing serial and parallel stencil- and stencil-like op...
The optimization of data parallel programs is a challenging open problem. We analyzed in detail the ...
A widely used class of codes are stencil codes. Their general structure is very simple: data points ...
dissertationStencil computations are operations on structured grids. They are frequently found in pa...
Stencil computations are widely used from physical simulations to machine-learning. They are embarra...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
Stencil computations are a widely used type of algorithm, found in applications from physical simula...
This research explores the idea that for certain optimization problems there is a way to parallelize...
This thesis studies the techniques of tiling optimizations for stencil programs. Traditionally, res...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
International audienceStencil computation represents an important numerical kernel in scientific com...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Code transformations, such as loop tiling and loop fusion, are of key importance for the efficient i...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
This paper describes a new technique for optimizing serial and parallel stencil- and stencil-like op...