1 Optimization of Linked List Prefix Computations on Multithreaded GPUs Using CUDA

Zheng Wei
Joseph Jaja

Publication date

December 2014

Abstract

We present a number of optimization techniques to compute prefix sums on linked lists and implement them on multithreaded GPUs using CUDA. Prefix computations on linked structures involve in general highly irregular fine grain memory accesses that are typical of many computations on linked lists, trees, and graphs. While the current generation of GPUs provides substantial computational power and extremely high bandwidth memory accesses, they may appear at first to be primarily geared toward streamed, highly data parallel computations. In this paper, we introduce an optimized multithreaded GPU algorithm for prefix computations through a randomization process that reduces the problem to a large number of fine-grain computations. We map these ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

1 Optimization of Linked List Prefix Computations on Multithreaded GPUs Using CUDA

Abstract

Extracted data

1 Optimization of Linked List Prefix Computations on Multithreaded GPUs Using CUDA

Abstract

Extracted data

Related items

Related items