OpenACC is a directive based parallel programming library that allows for the easy acceleration of existing C, C++ and Fortran based applications with minimal code modifications. By annotating the bottleneck causing a section of the code with OpenACC directives, the acceleration of the code can be simplified, leading for high portability of performance across different target Graphic Processing Units (GPUs). In this work, the portability of an implemented parallelizable chi-square based pixel similarity measurement algorithm has been evaluated on two consumer and professional grade GPUs. To our best knowledge, this is the first performance evaluation report that utilizes the OpenACC optimization clauses (collapse and tile) on different GPUs...
Abstract. Recently, OpenCL, a new open programming standard for GPGPU programming, has become availa...
In the field of high performance computing, GPUs play an important role. However, in order to be abl...
We present a portable platform, called PIC_ENGINE, for accelerating Particle-In-Cell (PIC) codes on ...
Graphics processing units and similar accelerators have been intensively used in general purpose com...
Abstract—In this paper, we construe key factors in design and evaluation of image processing algorit...
In recent years, GPU computing has been very popular for scientific applications, especially after t...
Connected components labelling (CCL) is one of the basic steps in various image-processing applicati...
Recent developments in processor architecture have settled a shift from sequential processing to par...
GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GP...
Nowadays vision systems are used with countless purposes. Moreover, the motion estimation is a disci...
International audienceThis article gives a methodological approach to accelerating an environment of...
In general, a computer program will execute instructions serially. These instructions will be run on...
International audienceOptical flow estimation stands as an essential component for motion detection ...
The process of the creation of texture images derived from a windowed GLCM coupled with the calculat...
Parallel programming is about performance, for otherwise we’d write a sequential program. A problem ...
Abstract. Recently, OpenCL, a new open programming standard for GPGPU programming, has become availa...
In the field of high performance computing, GPUs play an important role. However, in order to be abl...
We present a portable platform, called PIC_ENGINE, for accelerating Particle-In-Cell (PIC) codes on ...
Graphics processing units and similar accelerators have been intensively used in general purpose com...
Abstract—In this paper, we construe key factors in design and evaluation of image processing algorit...
In recent years, GPU computing has been very popular for scientific applications, especially after t...
Connected components labelling (CCL) is one of the basic steps in various image-processing applicati...
Recent developments in processor architecture have settled a shift from sequential processing to par...
GPU-accelerated computing drives current scientific research. Writing fast numeric algorithms for GP...
Nowadays vision systems are used with countless purposes. Moreover, the motion estimation is a disci...
International audienceThis article gives a methodological approach to accelerating an environment of...
In general, a computer program will execute instructions serially. These instructions will be run on...
International audienceOptical flow estimation stands as an essential component for motion detection ...
The process of the creation of texture images derived from a windowed GLCM coupled with the calculat...
Parallel programming is about performance, for otherwise we’d write a sequential program. A problem ...
Abstract. Recently, OpenCL, a new open programming standard for GPGPU programming, has become availa...
In the field of high performance computing, GPUs play an important role. However, in order to be abl...
We present a portable platform, called PIC_ENGINE, for accelerating Particle-In-Cell (PIC) codes on ...