This report presents SkePU, a C++ template library which provides a simple and unified interface for specifying data-parallel computations with the help of skeletons on GPUs using CUDA and OpenCL. The interface is also general enough to support other architectures, and SkePU implements both a sequential CPU and a parallel OpenMP back end. It also supports multi-GPU systems. Benchmarks show that copying data between the host and the GPU is often a bottleneck. Therefore a container which uses lazy memory copying has been implemented to avoid unnecessary memory transfers. SkePU was evaluated with small benchmarks and a larger application, a Runge-Kutta ODE solver. The results show that skeletal parallel programming is indeed a viable approach ...
Recent advances in multi-core and many-core processors requires programmers to exploit an increasing...
Application programming for modern heterogeneous systems which comprise multi-core CPUs and multiple...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
Abstract — SkePU is a C++ template library with a simple and unified interface for expressing data p...
SkePU (Skeleton Programming Framework for Multi-core CPU and Multi-GPU Systems) is a parallel comput...
c©2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Modern Graphics Processing Units (GPU) are increasingly used as general-purpose processors. While th...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) cu...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) re...
In this paper, we discuss the role, design and implementation of smart containers in the SkePU skele...
c©2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
part of European Conference on Object-Oriented Programming - ECOOP 2009International audienceThis pa...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and othe...
Graphics Processing Units (GPUs) have been successfully used to accelerate scientific applications d...
Recent advances in multi-core and many-core processors requires programmers to exploit an increasing...
Application programming for modern heterogeneous systems which comprise multi-core CPUs and multiple...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...
Abstract — SkePU is a C++ template library with a simple and unified interface for expressing data p...
SkePU (Skeleton Programming Framework for Multi-core CPU and Multi-GPU Systems) is a parallel comput...
c©2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
Modern Graphics Processing Units (GPU) are increasingly used as general-purpose processors. While th...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) cu...
Application development for modern high-performance systems with Graphics Processing Units (GPUs) re...
In this paper, we discuss the role, design and implementation of smart containers in the SkePU skele...
c©2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
part of European Conference on Object-Oriented Programming - ECOOP 2009International audienceThis pa...
Application development for modern high-performance systems with many cores, i.e., comprising multip...
The implementation of stencil computations on modern, mas-sively parallel systems with GPUs and othe...
Graphics Processing Units (GPUs) have been successfully used to accelerate scientific applications d...
Recent advances in multi-core and many-core processors requires programmers to exploit an increasing...
Application programming for modern heterogeneous systems which comprise multi-core CPUs and multiple...
Communicated by Guest Editors The implementation of stencil computations on modern, massively parall...