Free Launch: Optimizing GPU Dynamic Kernel Launches through Thread Reuse

Guoyang Chen
Xipeng Shen

Publication date

January 2016

Abstract

Supporting dynamic parallelism is important for GPU to benefit a broad range of applications. There are cur-rently two fundamental ways for programs to exploit dy-namic parallelism on GPU: a software-based approach with software-managed worklists, and a hardware-based approach through dynamic subkernel launches. Neither is satisfactory. The former is complicated to program and is often subject to some load imbalance; the latter su↵ers large runtime overhead. In this work, we propose free launch, a new software approach to overcoming the shortcomings of both meth-ods. It allows programmers to use subkernel launches to express dynamic parallelism. It employs a novel compiler-based code transformation named subkernel launch removal to replace ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Free Launch: Optimizing GPU Dynamic Kernel Launches through Thread Reuse

Abstract

Extracted data

Free Launch: Optimizing GPU Dynamic Kernel Launches through Thread Reuse

Abstract

Extracted data

Related items

Related items