Application development for modern high-performance systems with Graphics Processing Units (GPUs) relies on low-level programming approaches like CUDA and OpenCL, which leads to complex, lengthy and error-prone programs. In this paper, we present SkelCL – a high-level programming model for systems with multiple GPUs and its implementation as a library on top of OpenCL. SkelCL provides three main enhancements to the OpenCL standard: 1) computations are conveniently expressed using parallel patterns (skeletons); 2) memory management is simplified using parallel container data types; 3) an automatic data (re)distribution mechanism allows for scalability when using multi-GPU systems. We use a real-world example from the field of medical i...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and othe...
Algorithmic skeletons simplify software development: they abstract typical patterns of parallelism a...
AbstractApplication development for modern high-performance systems with Graphics Processing Units (...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) re...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) re...
Application programming for GPUs (Graphics Processing Units) is complex and error-prone...
While CUDA and OpenCL made general-purpose programming for Graphics Processing Units (GPU) pop...
c©2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) cu...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) cu...
The implementation of stencil computations on modern, massively parallel systems with GPUs and other...
Application programming for modern heterogeneous systems which comprise multi-core CPUs and multiple...
Application programming for GPUs (Graphics Processing Units) is complex and error-prone, becaus...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and othe...
Algorithmic skeletons simplify software development: they abstract typical patterns of parallelism a...
AbstractApplication development for modern high-performance systems with Graphics Processing Units (...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) re...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) re...
Application programming for GPUs (Graphics Processing Units) is complex and error-prone...
While CUDA and OpenCL made general-purpose programming for Graphics Processing Units (GPU) pop...
c©2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) cu...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) cu...
The implementation of stencil computations on modern, massively parallel systems with GPUs and other...
Application programming for modern heterogeneous systems which comprise multi-core CPUs and multiple...
Application programming for GPUs (Graphics Processing Units) is complex and error-prone, becaus...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and othe...
Algorithmic skeletons simplify software development: they abstract typical patterns of parallelism a...