Many developers have begun to realize that heterogeneous multi-core and many-core computer systems can provide sig-nificant performance opportunities to a range of applica-tions. Typical applications possess multiple components that can be parallelized; developers need to be equipped with proper performance tools to analyze program flow and identify ap-plication bottlenecks. In this paper, we analyze and profile the components of the Speeded Up Robust Features (SURF) Computer Vision algorithm written in OpenCL. Our profil-ing framework is developed using built-in OpenCL API func-tion calls, without the need for an external profiler. We show we can begin to identify performance bottlenecks and performance issues present in individual compone...
Utilizing heterogeneous platforms for computation has become a general trend, making the portability...
Abstract—In this paper, we parallelize and optimize the popular feature detection algorithms, i.e. S...
Parallel programming is vital to fully utilize the multicore architectures that dominate the process...
International audienceComputer vision applications constitute one of the key drivers for embedded ma...
AbstractComputer vision applications constitute one of the key drivers for embedded many-core archit...
Abstract. Heterogeneous computing has become prevalent due to the comput-ing power and low cost of G...
Abstract—Consumers of personal devices such as desktops, tablets, or smart phones run applications b...
Open Computing Language (OpenCL) is emerging as a standard for parallel programming of heterogeneous...
Recent developments in processor architecture have settled a shift from sequential processing to par...
In the last few years, the computing industry has changed its course from ever higher clock speeds t...
In this paper, we examined heterogeneous architectures, for their suitability to run the scale invar...
We optimize a visual object detection application (that uses Vision Video Library kernels) and show ...
Shared memory multicore processor technology is pervasive in mainstream computing. This new architec...
OpenCL (Open Computing Language) is a heterogeneous programming framework for developing application...
In this thesis, several implementations of an image back projection algorithm using Open Computing L...
Utilizing heterogeneous platforms for computation has become a general trend, making the portability...
Abstract—In this paper, we parallelize and optimize the popular feature detection algorithms, i.e. S...
Parallel programming is vital to fully utilize the multicore architectures that dominate the process...
International audienceComputer vision applications constitute one of the key drivers for embedded ma...
AbstractComputer vision applications constitute one of the key drivers for embedded many-core archit...
Abstract. Heterogeneous computing has become prevalent due to the comput-ing power and low cost of G...
Abstract—Consumers of personal devices such as desktops, tablets, or smart phones run applications b...
Open Computing Language (OpenCL) is emerging as a standard for parallel programming of heterogeneous...
Recent developments in processor architecture have settled a shift from sequential processing to par...
In the last few years, the computing industry has changed its course from ever higher clock speeds t...
In this paper, we examined heterogeneous architectures, for their suitability to run the scale invar...
We optimize a visual object detection application (that uses Vision Video Library kernels) and show ...
Shared memory multicore processor technology is pervasive in mainstream computing. This new architec...
OpenCL (Open Computing Language) is a heterogeneous programming framework for developing application...
In this thesis, several implementations of an image back projection algorithm using Open Computing L...
Utilizing heterogeneous platforms for computation has become a general trend, making the portability...
Abstract—In this paper, we parallelize and optimize the popular feature detection algorithms, i.e. S...
Parallel programming is vital to fully utilize the multicore architectures that dominate the process...