Application development for modern high-performance systems with Graphics Processing Units (GPUs) relies on low-level programming approaches like CUDA and OpenCL, which leads to complex, lengthy and error-prone programs. In this paper, we present SkelCL – a high-level programming model for systems with multiple GPUs and its implementation as a library on top of OpenCL. SkelCL provides three main enhancements to the OpenCL standard: 1) computations are conveniently expressed using parallel patterns (skeletons); 2) memory management is simplified using parallel container data types; 3) an automatic data (re)distribution mechanism allows for scalability when using multi-GPU systems. We use a real-world example from the field of medical i...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and othe...
Algorithmic skeletons simplify software development: they abstract typical patterns of parallelism a...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) re...
AbstractApplication development for modern high-performance systems with Graphics Processing Units (...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) re...
Application programming for GPUs (Graphics Processing Units) is complex and error-prone...
While CUDA and OpenCL made general-purpose programming for Graphics Processing Units (GPU) pop...
c©2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) cu...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) cu...
Application programming for modern heterogeneous systems which comprise multi-core CPUs and multiple...
The implementation of stencil computations on modern, massively parallel systems with GPUs and other...
Application programming for GPUs (Graphics Processing Units) is complex and error-prone, becaus...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and othe...
Algorithmic skeletons simplify software development: they abstract typical patterns of parallelism a...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) re...
AbstractApplication development for modern high-performance systems with Graphics Processing Units (...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) re...
Application programming for GPUs (Graphics Processing Units) is complex and error-prone...
While CUDA and OpenCL made general-purpose programming for Graphics Processing Units (GPU) pop...
c©2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) cu...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) cu...
Application programming for modern heterogeneous systems which comprise multi-core CPUs and multiple...
The implementation of stencil computations on modern, massively parallel systems with GPUs and other...
Application programming for GPUs (Graphics Processing Units) is complex and error-prone, becaus...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and othe...
Algorithmic skeletons simplify software development: they abstract typical patterns of parallelism a...