Optimizing GPU Memory Transactions for Convolution Operations

Lu, G
Zhang, W
Wang, Z

Open PDF

Open link

Publication date

September 2020

DOI

10.1109/CLUSTER49012.2020.00050

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Language

English

Abstract

Convolution computation is a common operation in deep neural networks (DNNs) and is often responsible for performance bottlenecks during training and inferencing. Existing approaches for accelerating convolution operations aim to reduce computational complexity. However, these strategies often increase the memory footprint with extra memory accesses, thereby leaving much room for performance improvement. This paper presents a novel approach to optimize memory access for convolution operations, specifically targeting GPU execution. Our approach leverages two optimization techniques to reduce the number of memory operations for convolution operations performed on the width and height dimensions. For convolution computations on the width dimen...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Optimizing GPU Memory Transactions for Convolution Operations

Abstract

Extracted data

Optimizing GPU Memory Transactions for Convolution Operations

Abstract

Extracted data

Related items

Related items