A study of the potential of locality-aware thread scheduling for GPUs

Nugteren, C.
Braak, van den, G.J.W.
Corporaal, H.

Open PDF

Open link

Publication date

January 2014

DOI

10.1007/978-3-319-14313-2_13

Publisher

Springer

Language

English

Abstract

Programming models such as CUDA and OpenCL allow the programmer to specify the independence of threads, effectively removing ordering constraints. Still, parallel architectures such as the graphics processing unit (GPU) do not exploit the potential of data-locality enabled by this independence. Therefore, programmers are required to manually perform data-locality optimisations such as memory coalescing or loop tiling. This work makes a case for locality-aware thread scheduling: re-ordering threads automatically for better locality to improve the programmability of multi-threaded processors. In particular, we analyse the potential of locality-aware thread scheduling for GPUs, considering among others cache performance, memory coalescing and ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A study of the potential of locality-aware thread scheduling for GPUs

Abstract

Extracted data

A study of the potential of locality-aware thread scheduling for GPUs

Abstract

Extracted data

Related items

Related items