Identifying scalar behavior in CUDA kernels

Collange, Sylvain

Publication date

January 2011

Publisher

HAL CCSD

Abstract

We propose a compiler analysis pass for programs expressed in the Single Program, Multiple Data (SPMD) programming model. It identifies statically several kinds of regular patterns that can occur between adjacent threads, including common computations, memory accesses at consecutive locations or at the same location and uniform control flow. This knowledge can be exploited by SPMD compilers targeting SIMD architectures. We present a compiler pass developed within the Ocelot framework that performs this analysis on NVIDIA CUDA programs at the PTX intermediate language level. Results are compared with optima obtained by simulation of several sets of CUDA benchmarks

Extracted data

We use cookies to provide a better user experience.

Data Protection

Identifying scalar behavior in CUDA kernels

Abstract

Extracted data

Identifying scalar behavior in CUDA kernels

Abstract

Extracted data

Related items

Related items