Asynchronous Optimization Methods for Efficient Training of Deep Neural Networks with Guarantees

Kungurtsev, Vyacheslav
Egan, Malcolm
Chatterjee, Bapi
Alistarh, Dan

Publication date

February 2021

Publisher

HAL CCSD

Abstract

International audienceAsynchronous distributed algorithms are a popular way to reduce synchronization costs in large-scale optimization, and in particular for neural network training. However, for nonsmooth and nonconvex objectives, few convergence guarantees exist beyond cases where closed-form proximal operator solutions are available. As training most popular deep neural networks corresponds to optimizing nonsmooth and nonconvex objectives, there is a pressing need for such convergence guarantees. In this paper, we analyze for the first time the convergence of stochastic asynchronous optimization for this general class of objectives. In particular, we focus on stochastic subgradient methods allowing for block variable partitioning, where...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Asynchronous Optimization Methods for Efficient Training of Deep Neural Networks with Guarantees

Abstract

Extracted data

Asynchronous Optimization Methods for Efficient Training of Deep Neural Networks with Guarantees

Abstract

Extracted data

Related items

Related items