We advocate the Loop-of-stencil-reduce pattern as a means of simplifying the implementation of data-parallel programs on heterogeneous multi-core platforms. Loop-of-stencil-reduce is general enough to subsume map, reduce, map-reduce, stencil, stencil-reduce, and, crucially, their usage in a loop in both data-parallel and streaming applications, or a combination of both. The pattern makes it possible to deploy a single stencil computation kernel on different GPUs. We discuss the implementation of Loop-of-stencil-reduce in FastFlow, a framework for the implementation of applications based on the parallel patterns. Experiments are presented to illustrate the use of Loop-of-stencil-reduce in developing data-parallel kernels running on heterogen...
A large number of algorithms for multidimensional signals processing and scientific computation come...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Current parallel programming frameworks aid developers to a great extent in implementing application...
In this paper we advocate the Loop-of-stencil-reduce pattern as a way to simplify the parallel progr...
Abstract—In this paper we advocate the Loop-of-stencil-reduce pattern as a way to simplify the paral...
The implementation of stencil computations on modern, massively parallel systems with GPUs and other...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and othe...
Stencil computations are iterative kernels often used to simulate the change in a discretized spatia...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
In recent years, Graphics Processing Units (GPUs) have piqued the interest of researchers in scienti...
A large number of algorithms for multidimensional signals processing and scientific computation come...
Performance optimization of stencil computations has been widely studied in the literature, since th...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
A large number of algorithms for multidimensional signals processing and scientific computation come...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Current parallel programming frameworks aid developers to a great extent in implementing application...
In this paper we advocate the Loop-of-stencil-reduce pattern as a way to simplify the parallel progr...
Abstract—In this paper we advocate the Loop-of-stencil-reduce pattern as a way to simplify the paral...
The implementation of stencil computations on modern, massively parallel systems with GPUs and other...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and othe...
Stencil computations are iterative kernels often used to simulate the change in a discretized spatia...
The key common bottleneck in most stencil codes is data movement, and prior research has shown that ...
Abstract Performance optimization of stencil computations has beenwidely studied in the literature, ...
In recent years, Graphics Processing Units (GPUs) have piqued the interest of researchers in scienti...
A large number of algorithms for multidimensional signals processing and scientific computation come...
Performance optimization of stencil computations has been widely studied in the literature, since th...
In this paper, we present Patus, a code generation and auto-tuning framework for stencil computation...
A large number of algorithms for multidimensional signals processing and scientific computation come...
Application codes reliably achieve performance far less than the advertised capabilities of existing...
Current parallel programming frameworks aid developers to a great extent in implementing application...