Optimizations for High Performance Computing on General Purpose GPUS

Singh, Ranvijay

Publication date

January 2018

Publisher

Purdue University (bepress)

Abstract

High performance Computing is increasingly being done on parallel machines like GPUs. In my work, I deal with 2 major kinds of optimizations: block size tuning and mixed precision tuning. Block size tuning involves selecting an optimal block size for CUDA kernels, where threads of execution are grouped into blocks. Earlier techniques for this involve running Autotuning, which involves multiple kernel executions; and Nvidia\u27s Occupancy Calculator, which gives multiple possible solutions, none of which might be the actual optimal. My technique uses an SVR based on static kernel features as well as dynamic features to predict an optimal block size. This is then evaluated for 89 kernels from 10 different applications. The second optimization...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Optimizations for High Performance Computing on General Purpose GPUS

Abstract

Extracted data

Optimizations for High Performance Computing on General Purpose GPUS

Abstract

Extracted data

Related items

Related items