Abstract. Recently, OpenCL, a new open programming standard for GPGPU programming, has become available in addition to CUDA. OpenCL can support various compute devices due to its higher abstraction pro-gramming framework. Since there is a semantic gap between OpenCL and compute devices, the OpenCL C compiler plays important roles to exploit the potential of compute devices and therefore its capabil-ity should be clarified. In this paper, the performance of CUDA and OpenCL programs is quantitatively evaluated. First, several CUDA and OpenCL programs of almost the same computations are developed, and their performances are compared. Then, the main factors causing their performance differences is investigated. The evaluation results suggest th...
Accelerator devices like the General Purpose Graphics Computing Units (GPGPUs) play an important rol...
In the field of high performance computing, GPUs play an important role. However, in order to be abl...
Parallel computing becomes a need to perform task as soon as possible. This can be done in two way i...
As an open, royalty-free framework for writing programs that execute across heterogeneous platforms,...
Recent developments in processor architecture have settled a shift from sequential processing to par...
Recent developments in processor architecture have settled a shift from sequential processing to par...
The proliferation of heterogeneous computing systems has led to increased interest in parallel archi...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
The use of modern, high-performance graphical processing units (GPUs) for acceleration of scientific...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
General purpose GPU based systems are highly attractive as they give potentially massive performance...
The proliferation of accelerators, in particular GPUs, over the past decade is im- pacting the way s...
Accelerator devices like the General Purpose Graphics Computing Units (GPGPUs) play an important rol...
In the field of high performance computing, GPUs play an important role. However, in order to be abl...
Parallel computing becomes a need to perform task as soon as possible. This can be done in two way i...
As an open, royalty-free framework for writing programs that execute across heterogeneous platforms,...
Recent developments in processor architecture have settled a shift from sequential processing to par...
Recent developments in processor architecture have settled a shift from sequential processing to par...
The proliferation of heterogeneous computing systems has led to increased interest in parallel archi...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
In this work, we evaluate OpenCL as a programming tool for developing performance-portable applicati...
The use of modern, high-performance graphical processing units (GPUs) for acceleration of scientific...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
General purpose GPU based systems are highly attractive as they give potentially massive performance...
The proliferation of accelerators, in particular GPUs, over the past decade is im- pacting the way s...
Accelerator devices like the General Purpose Graphics Computing Units (GPGPUs) play an important rol...
In the field of high performance computing, GPUs play an important role. However, in order to be abl...
Parallel computing becomes a need to perform task as soon as possible. This can be done in two way i...