Kernels for Multi-Core CPUs

John A. Stratton
Sam S. Stone
Wen-mei W. Hwu

Publication date

July 2015

Abstract

Abstract. CUDA is a data parallel programming model that supports several key abstractions- thread blocks, hierarchical memory and bar-rier synchronization- for writing applications. This model has proven effective in programming GPUs. In this paper we describe a framework called MCUDA, which allows CUDA programs to be executed efficiently on shared memory, multi-core CPUs. Our framework consists of a set of source-level compiler transformations and a runtime system for par-allel execution. Preserving program semantics, the compiler transforms threaded SPMD functions into explicit loops, performs fission to elimi-nate barrier synchronizations, and converts scalar references to thread-local data to replicated vector references. We describe a...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Kernels for Multi-Core CPUs

Abstract

Extracted data

Kernels for Multi-Core CPUs

Abstract

Extracted data

Related items

Related items