Warp-Aware Trace Scheduling for GPUs

James A. Jablin
Thomas B. Jablin
Onur Mutlu
Maurice Herlihy

Open link

Publication date

November 2014

DOI

10.1145/2628071.2628101

Citation count (estimate)

Abstract

GPU performance depends not only on thread/warp level parallelism (TLP) but also on instruction-level parallelism (ILP). It is not enough to schedule instructions within ba-sic blocks, it is also necessary to exploit opportunities for ILP optimization beyond branch boundaries. Unfortunately, modern GPUs cannot dynamically carry out such optimiza-tions because they lack hardware branch prediction and can-not speculatively execute instructions beyond a branch. We propose to circumvent these limitations by adapting Trace Scheduling, a technique originally developed for mi-crocode optimization. Trace Scheduling divides code into traces (or paths), and optimizes each trace in a context-independent way. Adapting Trace Scheduling to GPU code requi...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Warp-Aware Trace Scheduling for GPUs

Abstract

Extracted data

Warp-Aware Trace Scheduling for GPUs

Abstract

Extracted data

Related items

Related items