In this work, we examine the performance and energy efficiency when using Python for developing HPC codes running on the GPU. We investigate the portability of performance and energy efficiency between CUDA and OpenCL; between GPU generations; and between low-end, mid-range and high-end GPUs. Our findings show that for some combinations of GPU and GPU code, there is a significant speedup for CUDA over OpenCL, but that this does not hold in general. Our experiments show that performance in general varies more between different GPUs, than between using CUDA and OpenCL. Finally, we show that tuning for performance is a good way of tuning for energy efficiency
Abstract Graphics processing units (GPUs) have tremendous computing power, but are hard to program. ...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
In this work, we examine the performance, energy efficiency, and usability when using Python for dev...
In this work, we examine the performance, energy efficiency, and usability when using Python for dev...
Abstract. Recently, OpenCL, a new open programming standard for GPGPU programming, has become availa...
Graphic Processing Units (GPUs) are becoming popular in computational physics. Seeing the increasing...
Compiled languages, such as C++, are conventionally used in the field of high performance computing ...
Compiled languages, such as C++, are conventionally used in the field of high performance computing ...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
GPU technologies are the paradigm shift in modern computing. This book will take you through archite...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
General purpose GPU based systems are highly attractive as they give potentially massive performance...
Parallel computing becomes a need to perform task as soon as possible. This can be done in two way i...
A lot of effort from academia and industry has been invested in exploring the suitability of low-pow...
Abstract Graphics processing units (GPUs) have tremendous computing power, but are hard to program. ...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
In this work, we examine the performance, energy efficiency, and usability when using Python for dev...
In this work, we examine the performance, energy efficiency, and usability when using Python for dev...
Abstract. Recently, OpenCL, a new open programming standard for GPGPU programming, has become availa...
Graphic Processing Units (GPUs) are becoming popular in computational physics. Seeing the increasing...
Compiled languages, such as C++, are conventionally used in the field of high performance computing ...
Compiled languages, such as C++, are conventionally used in the field of high performance computing ...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
GPU technologies are the paradigm shift in modern computing. This book will take you through archite...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
General purpose GPU based systems are highly attractive as they give potentially massive performance...
Parallel computing becomes a need to perform task as soon as possible. This can be done in two way i...
A lot of effort from academia and industry has been invested in exploring the suitability of low-pow...
Abstract Graphics processing units (GPUs) have tremendous computing power, but are hard to program. ...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...
This paper explores the performance and energy efficiency of CUDA-enabled GPUs and multi-core SIMD C...