Exploiting intra-warp address monotonicity for fast memory coalescing in GPUs

Rodriguez-Simmonds, Hector E

Publication date

January 2015

Publisher

Purdue University (bepress)

Abstract

Graphics Processing Units (GPUs) are growing increasingly popular as general purpose compute accelerators. GPUs are best suited for applications which have abundant data parallelism wherein the computation expressed as a single thread can be applied over a large set of data items. One key constraint that affects application performance on GPUs is that the underlying hardware is single-instruction, multiple data (SIMD) hardware which requires parallel instructions from the multiple threads to execute in a lock-step manner. The benefits of lock-step execution can be seriously degraded if the threads diverge (because of memory or branches). Specifically in the case of memory, the addresses from each thread in a SIMD wavefront/warp must be co...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Exploiting intra-warp address monotonicity for fast memory coalescing in GPUs

Abstract

Extracted data

Exploiting intra-warp address monotonicity for fast memory coalescing in GPUs

Abstract

Extracted data

Related items

Related items