Asynchronous Optimization Methods for Efficient Training of Deep Neural Networks with Guarantees

Kungurtsev, Vyacheslav
Egan, Malcolm
Chatterjee, Bapi
Alistarh, Dan

Open link

Publication date

May 2021

DOI

10.1609/aaai.v35i9.16999

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

Asynchronous distributed algorithms are a popular way to reduce synchronization costs in large-scale optimization, and in particular for neural network training. However, for nonsmooth and nonconvex objectives, few convergence guarantees exist beyond cases where closed-form proximal operator solutions are available. As training most popular deep neural networks corresponds to optimizing nonsmooth and nonconvex objectives, there is a pressing need for such convergence guarantees. In this paper, we analyze for the first time the convergence of stochastic asynchronous optimization for this general class of objectives. In particular, we focus on stochastic subgradient methods allowing for block variable partitioning, where the shared model is ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Asynchronous Optimization Methods for Efficient Training of Deep Neural Networks with Guarantees

Abstract

Extracted data

Asynchronous Optimization Methods for Efficient Training of Deep Neural Networks with Guarantees

Abstract

Extracted data

Related items

Related items