The Model of Computation of CUDA and its Formal Semantics

Habermaier, Axel

Publication date

October 2011

Abstract

We formalize the model of computation of modern graphics cards based on the specification of Nvidia's Compute Unified Device Architecture (CUDA). CUDA programs are executed by thousands of threads concurrently and have access to several different types of memory with unique access patterns and latencies. The underlying hardware uses a single instruction, multiple threads execution model that groups threads into warps. All threads of the same warp execute the program in lockstep. If threads of the same warp execute a data-dependent control flow instruction, control flow might diverge and the different execution paths are executed sequentially. Once all paths complete execution, all threads are executed in parallel again. An operational seman...

Extracted data

We use cookies to provide a better user experience.

Data Protection

The Model of Computation of CUDA and its Formal Semantics

Abstract

Extracted data

The Model of Computation of CUDA and its Formal Semantics

Abstract

Extracted data

Related items

Related items