Pipelined Training with Stale Weights in Deep Convolutional Neural Networks

Zhang, Lifu
Abdelrahman, Tarek S.

Open link

Publication date

September 2021

DOI

10.1155/2021/3839543

Publisher

University of Toronto

Abstract

The growth in size and complexity of convolutional neural networks (CNNs) is forcing the partitioning of a network across multiple accelerators during training and pipelining of backpropagation computations over these accelerators. Pipelining results in the use of stale weights. Existing approaches to pipelined training avoid or limit the use of stale weights with techniques that either underutilize accelerators or increase training memory footprint. This paper contributes a pipelined backpropagation scheme that uses stale weights to maximize accelerator utilization and keep memory overhead modest. It explores the impact of stale weights on the statistical efficiency and performance using 4 CNNs (LeNet-5, AlexNet, VGG, and ResNet) and shows...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Pipelined Training with Stale Weights in Deep Convolutional Neural Networks

Abstract

Extracted data

Pipelined Training with Stale Weights in Deep Convolutional Neural Networks

Abstract

Extracted data

Related items

Related items