On the Last Iterate Convergence of Momentum Methods

Li, Xiaoyu
Liu, Mingrui
Orabona, Francesco

Publication date

July 2022

Language

English

Abstract

SGD with Momentum (SGDM) is a widely used family of algorithms for large-scale optimization of machine learning problems. Yet, when optimizing generic convex functions, no advantage is known for any SGDM algorithm over plain SGD. Moreover, even the most recent results require changes to the SGDM algorithms, like averaging of the iterates and a projection onto a bounded domain, which are rarely used in practice. In this paper, we focus on the convergence rate of the last iterate of SGDM. For the first time, we prove that for any constant momentum factor, there exists a Lipschitz and convex function for which the last iterate of SGDM suffers from a suboptimal convergence rate of $\Omega(\frac{\ln T}{\sqrt{T}})$ after $T$ iterations. Based on ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

On the Last Iterate Convergence of Momentum Methods

Abstract

Extracted data

On the Last Iterate Convergence of Momentum Methods

Abstract

Extracted data

Related items

Related items