Pipelined Training with Stale Weights of Deep Convolutional Neural Networks

Zhang, Lifu

Publication date

June 2020

Abstract

Existing approaches that partition a convolutional neural network (CNN) onto multiple accelerators and pipeline the training computations avoid or limit the use of stale weights, and they either underutilize accelerators or increase memory footprint. We explore the impact of stale weights on the statistical efficiency and performance in a pipelined backpropagation scheme that maximizes accelerator utilization and keeps memory overhead modest. We use LeNet-5, AlexNet, VGG and ResNet, and show that when pipelining is limited to early layers in a network, training with stale weights converges and results in models with comparable inference accuracies to those resulting from non-pipelined training. However, when pipelining is deeper in the netw...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Pipelined Training with Stale Weights of Deep Convolutional Neural Networks

Abstract

Extracted data

Pipelined Training with Stale Weights of Deep Convolutional Neural Networks

Abstract

Extracted data

Related items

Related items