We optimize a visual object detection application (that uses Vision Video Library kernels) and show that OpenCL is a unified programming paradigm that can provide high performance when running on the Ivy Bridge heterogeneous on-chip architecture. We evaluate different mapping techniques and show that run-ning each kernel where it fits the best and using software pipelining can provide 1.91 times higher perfor-mance, and 42 % better energy efficiency. We also show how to trade accuracy for energy at runtime. Overall, our application can perform accurate object detection at 40 frames per second (fps) rate, in an energy effi-cient manner. 1
AbstractComputer vision applications constitute one of the key drivers for embedded many-core archit...
Many developers have begun to realize that heterogeneous multi-core and many-core computer systems c...
A desired trend within high energy physics is to increase particle accelerator luminosities, leading...
We optimize a visual object detection application (that uses Vision Video Library kernels) and show ...
Abstract—Consumers of personal devices such as desktops, tablets, or smart phones run applications b...
In this paper we present our experiences with the implementation of an object detector using OpenCL....
Heterogeneous multicore architectures with CPU and add-on GPUs or streaming processors are now widel...
Abstract—In this paper, we parallelize and optimize the popular feature detection algorithms, i.e. S...
In this paper, we examined heterogeneous architectures, for their suitability to run the scale invar...
Recent developments in embedded processors have enabled heterogeneous computing on mobile devices us...
Abstract. Heterogeneous computing has become prevalent due to the comput-ing power and low cost of G...
International audienceComputer vision applications constitute one of the key drivers for embedded ma...
The use of Computer Vision in programmable mobile devices could lead to novel and creative applicati...
Recent developments in processor architecture have settled a shift from sequential processing to par...
Heterogeneous systems have gained popularity due to the rapid growth in data and the need for proces...
AbstractComputer vision applications constitute one of the key drivers for embedded many-core archit...
Many developers have begun to realize that heterogeneous multi-core and many-core computer systems c...
A desired trend within high energy physics is to increase particle accelerator luminosities, leading...
We optimize a visual object detection application (that uses Vision Video Library kernels) and show ...
Abstract—Consumers of personal devices such as desktops, tablets, or smart phones run applications b...
In this paper we present our experiences with the implementation of an object detector using OpenCL....
Heterogeneous multicore architectures with CPU and add-on GPUs or streaming processors are now widel...
Abstract—In this paper, we parallelize and optimize the popular feature detection algorithms, i.e. S...
In this paper, we examined heterogeneous architectures, for their suitability to run the scale invar...
Recent developments in embedded processors have enabled heterogeneous computing on mobile devices us...
Abstract. Heterogeneous computing has become prevalent due to the comput-ing power and low cost of G...
International audienceComputer vision applications constitute one of the key drivers for embedded ma...
The use of Computer Vision in programmable mobile devices could lead to novel and creative applicati...
Recent developments in processor architecture have settled a shift from sequential processing to par...
Heterogeneous systems have gained popularity due to the rapid growth in data and the need for proces...
AbstractComputer vision applications constitute one of the key drivers for embedded many-core archit...
Many developers have begun to realize that heterogeneous multi-core and many-core computer systems c...
A desired trend within high energy physics is to increase particle accelerator luminosities, leading...