Abstract In this paper, we present PATUS, a code gener-ation and auto-tuning framework for stencil computations targeted at multi- and manycore processors, such as mul-ticore CPUs and graphics processing units. PATUS, which stands for “Parallel Autotuned Stencils, ” generates a com-pute kernel from a specification of the stencil operation and a strategy which describes the parallelization and optimiza-tion to be applied, and leverages the autotuning methodol-ogy to optimize strategy-specific parameters for the given hardware architecture
Stencil computations arise in many scientific computing do-mains, and often represent time-critical ...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
This work introduces a generalized framework for automatically tuning stencil computations to achiev...
Although stencil auto-tuning has shown tremendous potential in effectively utilizing architectural r...
This study focuses on the key numerical technique of stencil computations, used in many different sc...
The recent transformation from an environment where gains in computational performance came from inc...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
On multi-core clusters or supercomputers, how to get good performance when running high performance ...
International audienceStencil based computation on structured grids is a kernel at the heart of a la...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
The focus of this work is the automatic performance tuning of stencil computations on Graphics Proce...
Performance optimization of stencil computations has been widely studied in the literature, since th...
Stencil computations arise in many scientific computing do-mains, and often represent time-critical ...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
This work introduces a generalized framework for automatically tuning stencil computations to achiev...
Although stencil auto-tuning has shown tremendous potential in effectively utilizing architectural r...
This study focuses on the key numerical technique of stencil computations, used in many different sc...
The recent transformation from an environment where gains in computational performance came from inc...
Understanding the most efficient design and utilization of emerging multicore systems is one of the ...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
On multi-core clusters or supercomputers, how to get good performance when running high performance ...
International audienceStencil based computation on structured grids is a kernel at the heart of a la...
Stencil computations are a class of algorithms operating on multi-dimensional arrays, which update a...
The focus of this work is the automatic performance tuning of stencil computations on Graphics Proce...
Performance optimization of stencil computations has been widely studied in the literature, since th...
Stencil computations arise in many scientific computing do-mains, and often represent time-critical ...
We propose and evaluate a novel strategy for tuning the performance of a class of stencil computatio...
Communicated by Guest Editors Our aim is to apply program transformations to stencil codes in order ...