Preemptive Thread Block Scheduling with Online Structural Runtime Prediction for Concurrent GPGPU Kernels

Sreepathi Pai
R. Govindarajan
Matthew J. Thazhuthaveetil

Open link

Publication date

January 2014

DOI

10.1145/2628071.2628117

Citation count (estimate)

Abstract

Recent NVIDIA Graphics Processing Units (GPUs) can execute multiple kernels concurrently. On these GPUs, the thread block scheduler (TBS) currently uses the FIFO policy to schedule thread blocks of concurrent kernels. We show that the FIFO policy leaves performance to chance, resulting in significant loss of performance and fairness. To improve performance and fairness, we propose use of the preemptive Shortest Remaining Time First (SRTF) policy instead. Al-though SRTF requires an estimate of runtime of GPU kernels, we show that such an estimate of the runtime can be easily obtained using online profiling and exploiting a simple ob-servation on GPU kernels ’ grid structure. Specifically, we propose a novel Structural Runtime Predictor. Usin...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Preemptive Thread Block Scheduling with Online Structural Runtime Prediction for Concurrent GPGPU Kernels

Abstract

Extracted data

Preemptive Thread Block Scheduling with Online Structural Runtime Prediction for Concurrent GPGPU Kernels

Abstract

Extracted data

Related items

Related items