Programs developed under the Compute Unified Device Architecture obtain the highest performance rate, when the exploitation of hardware resources on a Graphics Processing Unit (GPU) is maximized. In order to achieve this purpose, load balancing among threads and a high value of processor occupancy, i.e. the ratio of active threads, are indispensable. However, in certain applications, an optimally balanced implementation may limit the occupancy, due to a greater need for registers and shared memory. This is the case of the Fast Generalized Hough Transform (Fast GHT), an image-processing technique for localizing an object within an image. In this work, we present two parallelization alternatives for the Fast GHT, one that optimizes the load b...
General-purpose Graphics Processing Units (GPGPUs) are an important class of architectures that offe...
In the last three years, GPUs are more and more being used for general purpose applications instead ...
Cet article présente une évaluation des performances de 14 architectures actuelles : 8 processeurs ...
The Hough transform is a commonly used algorithm to detect lines and other features in images. It is...
In this paper we present GPU-Quicksort, an efficientQuicksort algorithm suitable for highly parallel...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
Graphics Processing Units (GPUs) are growing increasingly popular as general purpose compute acceler...
Graphic processors are becoming faster and faster. Computational power within graphic processing uni...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
Graphics Processing Units (GPUs) are a fast evolving architecture. Over the last decade their progra...
We present an efficient model to analyze and improve the performance of general-purpose computation ...
In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput co...
The research domain of Multimedia Content Analysis (MMCA) considers all aspects of the automated ext...
Designing parallel models that fully utilize the computation capabilities of Graphics Processing Uni...
General-purpose Graphics Processing Units (GPGPUs) are an important class of architectures that offe...
In the last three years, GPUs are more and more being used for general purpose applications instead ...
Cet article présente une évaluation des performances de 14 architectures actuelles : 8 processeurs ...
The Hough transform is a commonly used algorithm to detect lines and other features in images. It is...
In this paper we present GPU-Quicksort, an efficientQuicksort algorithm suitable for highly parallel...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
Graphics Processing Units (GPUs) are growing increasingly popular as general purpose compute acceler...
Graphic processors are becoming faster and faster. Computational power within graphic processing uni...
This paper presents a novel optimizing compiler for general purpose computation on graphics processi...
Graphics Processing Units (GPUs) are a fast evolving architecture. Over the last decade their progra...
We present an efficient model to analyze and improve the performance of general-purpose computation ...
In recent years, GPGPUs have experienced tremendous growth as general-purpose and high-throughput co...
The research domain of Multimedia Content Analysis (MMCA) considers all aspects of the automated ext...
Designing parallel models that fully utilize the computation capabilities of Graphics Processing Uni...
General-purpose Graphics Processing Units (GPGPUs) are an important class of architectures that offe...
In the last three years, GPUs are more and more being used for general purpose applications instead ...
Cet article présente une évaluation des performances de 14 architectures actuelles : 8 processeurs ...