Simulation and architecture improvements of atomic operations on GPU scratchpad memory

Gert-jan Van Den Braak
Henk Corporaal

Publication date

January 2013

Publisher

IEEE

Abstract

Abstract—GPUs are increasingly used as compute accelera-tors. With a large number of cores executing an even larger number of threads, significant speed-ups can be attained for parallel workloads. Applications that rely on atomic operations, such as histogram and Hough transform, suffer from serialization of threads in case they update the same memory location. Previous work shows that reducing this serialization with software techniques can increase performance by an order of magnitude. We observe, however, that some serialization remains and still slows down these applications. Therefore, this paper proposes to use a hash function in both the addressing of the banks and the locks of the scratchpad memory. To measure the effects of these c...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Simulation and architecture improvements of atomic operations on GPU scratchpad memory

Abstract

Extracted data

Simulation and architecture improvements of atomic operations on GPU scratchpad memory

Abstract

Extracted data

Related items

Related items