A Tool for Automatically Suggesting Source-Code Optimizations for Complex GPU Kernels

Saeed Taheri
Apan Qasem
Martin Burtscher

Publication date

January 2016

Abstract

Abstract- Future computing systems, from handhelds to su-percomputers, will undoubtedly be more parallel and heter-ogeneous than today’s systems to provide more performance and energy efficiency. Thus, GPUs are increasingly being used to accelerate general-purpose applications, including applications with data-dependent, irregular control flow and memory access patterns. However, the growing com-plexity, exposed memory hierarchy, incoherence, heteroge-neity, and parallelism will make accelerator-based systems progressively more difficult to program. In the foreseeable future, the vast majority of programmers will no longer be able to extract additional performance or energy-savings from next-generation systems because the programming will b...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A Tool for Automatically Suggesting Source-Code Optimizations for Complex GPU Kernels

Abstract

Extracted data

A Tool for Automatically Suggesting Source-Code Optimizations for Complex GPU Kernels

Abstract

Extracted data

Related items

Related items