OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA

López Castro, Roberto
Andrade, Diego
Fraguela, Basilio B.

Open PDF

Open link

Publication date

January 2021

DOI

10.3390/math9172033

Publisher

MDPI AG

Language

English

Abstract

[Abstract] Improving the performance of the convolution operation has become a key target for High Performance Computing (HPC) developers due to its prevalence in deep learning applied mainly to video processing. The improvement is being pushed by algorithmic and implementation innovations. Algorithmically, the convolution can be solved as it is mathematically enunciated, but other methods allow to transform it into a Fast Fourier Transform (FFT) or a GEneral Matrix Multiplication (GEMM). In this latter group, the Winograd algorithm is a state-of-the-art variant that is specially suitable for smaller convolutions. In this paper, we present openCNN, an optimized CUDA C++ implementation of the Winograd convolution algorithm. Our approach achi...

Extracted data

We use cookies to provide a better user experience.

Data Protection

OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA

Abstract

Extracted data

OpenCNN: A Winograd Minimal Filtering Algorithm Implementation in CUDA

Abstract

Extracted data

Related items

Related items