AdaCN: An Adaptive Cubic Newton Method for Nonconvex Stochastic Optimization

Yan Liu
Maojun Zhang
Zhiwei Zhong
Xiangrong Zeng

Open link

Publication date

January 2021

DOI

10.1155/2021/5790608

Publisher

Hindawi Limited

Journal

Computational Intelligence and Neuroscience

Abstract

In this work, we introduce AdaCN, a novel adaptive cubic Newton method for nonconvex stochastic optimization. AdaCN dynamically captures the curvature of the loss landscape by diagonally approximated Hessian plus the norm of difference between previous two estimates. It only requires at most first order gradients and updates with linear complexity for both time and memory. In order to reduce the variance introduced by the stochastic nature of the problem, AdaCN hires the first and second moment to implement and exponential moving average on iteratively updated stochastic gradients and approximated stochastic Hessians, respectively. We validate AdaCN in extensive experiments, showing that it outperforms other stochastic first order methods (...

Extracted data

We use cookies to provide a better user experience.

Data Protection

AdaCN: An Adaptive Cubic Newton Method for Nonconvex Stochastic Optimization

Abstract

Extracted data

AdaCN: An Adaptive Cubic Newton Method for Nonconvex Stochastic Optimization

Abstract

Extracted data

Related items

Related items