Do optimization methods in deep learning applications matter?

Ozyildirim, Buse Melis
Kiran, Mariam

Publication date

February 2020

Publisher

eScholarship, University of California

Abstract

With advances in deep learning, exponential data growth and increasing model complexity, developing efficient optimization methods are attracting much research attention. Several implementations favor the use of Conjugate Gradient (CG) and Stochastic Gradient Descent (SGD) as being practical and elegant solutions to achieve quick convergence, however, these optimization processes also present many limitations in learning across deep learning applications. Recent research is exploring higher-order optimization functions as better approaches, but these present very complex computational challenges for practical use. Comparing first and higher-order optimization functions, in this paper, our experiments reveal that Levemberg-Marquardt (LM) sig...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Do optimization methods in deep learning applications matter?

Abstract

Extracted data

Do optimization methods in deep learning applications matter?

Abstract

Extracted data

Related items

Related items