Scratchpad Sharing in GPUs

Jatala, Vishwesh
Anantpur, Jayvant
Karkare, Amey

Publication date

January 2017

DOI

Abstract

General-Purpose Graphics Processing Unit (GPGPU) applications exploit on-chip scratchpad memory available in the Graphics Processing Units (GPUs) to improve performance. The amount of thread level parallelism (TLP) present in the GPU is limited by the number of resident threads, which in turn depends on the availability of scratchpad memory in its streaming multiprocessor (SM). Since the scratchpad memory is allocated at thread block granularity, part of the memory may remain unutilized. In this article, we propose architectural and compiler optimizations to improve the scratchpad memory utilization. Our approach, called Scratchpad Sharing, addresses scratchpad under-utilization by launching additional thread blocks in each SM. These thread...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Scratchpad Sharing in GPUs

Abstract

Extracted data

Scratchpad Sharing in GPUs

Abstract

Extracted data

Related items

Related items