Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning

Koloskova, Anastasia
Stich, Sebastian U.
Jaggi, Martin

Publication date

November 2022

Abstract

We study the asynchronous stochastic gradient descent algorithm for distributed training over n workers which have varying computation and communication frequency over time. In this algorithm, workers compute stochastic gradients in parallel at their own pace and return those to the server without any synchronization. Existing convergence rates of this algorithm for non-convex smooth objectives depend on the maximum gradient delay τ_{max} and show that an ϵ-stationary point is reached after O(σ^2ϵ^{−2}+τ_{max}ϵ^{−1}) iterations, where σ denotes the variance of stochastic gradients. In this work (i) we obtain a tighter convergence rate of O(σ^2ϵ^{−2}+ √ τ_{max}τ_{avg}ϵ^{−1}) without any change in the algorithm where τ_{avg} is the average d...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning

Abstract

Extracted data

Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning

Abstract

Extracted data

Related items

Related items