Accelerator processors allow energy-efficient computation at high performance, especially for computationintensive applications. There exists a plethora of different accelerator architectures, such as GPUs and the Cell Broadband Engine. Each accelerator has its own programming language, but the recently introduced OpenCL language unifies accelerator programming languages. Hereby, OpenCL achieves functional protability, allowing to reduce the development time of kernels. Functional portability however has limited value without performance portability: the possibility to re-use optimized kernels with good performance. This paper investigates the specificity of code optimizations to accelerator architecture and the severity of lack of performa...
Abstract The architecture of high performance computing systems is becoming more and more heteroge...
Recent developments in processor architecture have settled a shift from sequential processing to par...
AbstractThe architecture of high performance computing systems is becoming more and more heterogeneo...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
This paper reports on the development of an MPI/OpenCL implementation of LU, an application-level be...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
This paper investigates the development of a molecular dynamics code that is highly portable between...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
The trend of using co-processors as accelerators to perform certain tasks is rising in the parallel...
During the past decade, accelerators, such as NVIDIA CUDA GPUs and Intel Xeon Phis, have seen an inc...
Abstract. Recently, OpenCL, a new open programming standard for GPGPU programming, has become availa...
As an open, royalty-free framework for writing programs that execute across heterogeneous platforms,...
Open Computing Language (OpenCL) is an open standard for writing portable software for heterogeneous...
The performance portability of OpenCL kernel implementa-tions for common memory bandwidth limited li...
In recent years the use of co-processors to accelerate specific tasks is becoming more common. To si...
Abstract The architecture of high performance computing systems is becoming more and more heteroge...
Recent developments in processor architecture have settled a shift from sequential processing to par...
AbstractThe architecture of high performance computing systems is becoming more and more heterogeneo...
Accelerator processors allow energy-efficient computation at high performance, especially for comput...
This paper reports on the development of an MPI/OpenCL implementation of LU, an application-level be...
This work describes my solution to the performance portability problem: between CPUs and GPUs in par...
This paper investigates the development of a molecular dynamics code that is highly portable between...
The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using...
The trend of using co-processors as accelerators to perform certain tasks is rising in the parallel...
During the past decade, accelerators, such as NVIDIA CUDA GPUs and Intel Xeon Phis, have seen an inc...
Abstract. Recently, OpenCL, a new open programming standard for GPGPU programming, has become availa...
As an open, royalty-free framework for writing programs that execute across heterogeneous platforms,...
Open Computing Language (OpenCL) is an open standard for writing portable software for heterogeneous...
The performance portability of OpenCL kernel implementa-tions for common memory bandwidth limited li...
In recent years the use of co-processors to accelerate specific tasks is becoming more common. To si...
Abstract The architecture of high performance computing systems is becoming more and more heteroge...
Recent developments in processor architecture have settled a shift from sequential processing to par...
AbstractThe architecture of high performance computing systems is becoming more and more heterogeneo...