The high computational throughput of modern graphics processing units (GPUs) make them the de-facto architecture for high-performance computing applications. However, to achieve peak performance, GPUs require highly parallel workloads, as well as memory access patterns that exhibit good locality of reference. As a result, many state-of-the-art algorithms and data structures designed for GPUs sacrifice work-optimality to achieve the necessary parallelism. Furthermore, some abstract data types are avoided completely due to there being no corresponding data structure that performs well on the GPU. One such abstract data type is the priority queue. Many well-known algorithms rely on priority queue operations as a building block. While various p...
The computing power of current Graphical Processing Units (GPUs) has increased rapidly over the year...
In this paper, we present a novel approach for parallel sorting on stream processing architectures. ...
In this paper, we present a novel approach for par-allel sorting on stream processing architectures....
The concurrent priority queue is one of the shared memory data structures that can be dynamically ma...
Graphics processing units (GPUs) are increasingly applied to accelerate tasks such as graph problems...
The massive parallelism provided by general-purpose GPUs (GPGPUs) possessing numerous compute thread...
Modern Graphics Processing Units (GPUs) provide high computation power at low costs and have been de...
The Graphics Processing Unit (GPU) has become a mainstream computing platform for a wide range of ap...
Mechanisms for improving the execution efficiency of graph algorithms on Data-Parallel Architectures...
General purpose programming on the graphics processing units (GPGPU) has received a lot of attention...
Graphics Processing Units (GPUs) are a fast evolving architecture. Over the last decade their progra...
We present a parallel priority data structure that improves the running time of certain algorithms f...
Massively parallel, throughput-oriented systems such as graphics processing units (GPUs) offer high ...
Finding the shortest paths from a single source to all other vertices is a fundamental method used i...
We present four CUDA based parallel implementations of the Space-Saving algorithm for determining fr...
The computing power of current Graphical Processing Units (GPUs) has increased rapidly over the year...
In this paper, we present a novel approach for parallel sorting on stream processing architectures. ...
In this paper, we present a novel approach for par-allel sorting on stream processing architectures....
The concurrent priority queue is one of the shared memory data structures that can be dynamically ma...
Graphics processing units (GPUs) are increasingly applied to accelerate tasks such as graph problems...
The massive parallelism provided by general-purpose GPUs (GPGPUs) possessing numerous compute thread...
Modern Graphics Processing Units (GPUs) provide high computation power at low costs and have been de...
The Graphics Processing Unit (GPU) has become a mainstream computing platform for a wide range of ap...
Mechanisms for improving the execution efficiency of graph algorithms on Data-Parallel Architectures...
General purpose programming on the graphics processing units (GPGPU) has received a lot of attention...
Graphics Processing Units (GPUs) are a fast evolving architecture. Over the last decade their progra...
We present a parallel priority data structure that improves the running time of certain algorithms f...
Massively parallel, throughput-oriented systems such as graphics processing units (GPUs) offer high ...
Finding the shortest paths from a single source to all other vertices is a fundamental method used i...
We present four CUDA based parallel implementations of the Space-Saving algorithm for determining fr...
The computing power of current Graphical Processing Units (GPUs) has increased rapidly over the year...
In this paper, we present a novel approach for parallel sorting on stream processing architectures. ...
In this paper, we present a novel approach for par-allel sorting on stream processing architectures....