GPU Code Optimization using Abstract Kernel Emulation and Sensitivity Analysis

Hong, Changwan
Sukumaran-Rajam, Aravind
Kim, Jinsung
Rawat, Prashant,
Krishnamoorthy, Sriram
Pouchet, Louis-Noël
Rastello, Fabrice
Sadayappan, Ponnuswamy

Open PDF

Open link

Publication date

June 2018

DOI

10.1145/3192366.3192397

Publisher

Association for Computing Machinery (ACM)

Abstract

International audienceIn this paper, we develop an approach to GPU kernel optimization by focusing on identification of bottleneck resources and determining optimization parameters that can alleviate the bottleneck. Performance modeling for GPUs is done by abstract kernel emulation along with latency/gap modeling of resources. Sensitivity analysis with respect to resource latency/gap parameters is used to predict the bottleneck resource for a given kernel’s execution. The utility of the bottleneck analysis is demonstrated in two contexts: 1) Coupling the new bottleneck-driven optimization strategy with the OpenTuner auto-tuner: experimental results on all kernels from the Rodinia suite and GPU tensor contraction kernels from the NWChem comp...

Extracted data

We use cookies to provide a better user experience.

Data Protection

GPU Code Optimization using Abstract Kernel Emulation and Sensitivity Analysis

Abstract

Extracted data

GPU Code Optimization using Abstract Kernel Emulation and Sensitivity Analysis

Abstract

Extracted data

Related items

Related items