Efficient Second-Order Methods for Non-Convex Optimization and Machine Learning

Yao, Zhewei

Publication date

January 2021

Publisher

eScholarship, University of California

Abstract

Hessian-based analysis/computation is widely used in scientific computing. However, due to the (incorrect, but in our experience widespread) belief that Hessian-based computations are infeasible for large machine learning (ML) problems, the majority of work in ML (except for quite small problems) only performs the first-order methods. However, using sub-sampling and randomized numerical linear algebra algorithms, the computation of second-order methods can be efficiently extracted for large-scale machine learning problems. In this thesis, we consider three use cases of second-order methods as follows: (i) For non-convex optimization and/or ML problems, we propose inexact variants of three classic Newton-type methods---Trust Region method, C...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Efficient Second-Order Methods for Non-Convex Optimization and Machine Learning

Abstract

Extracted data

Efficient Second-Order Methods for Non-Convex Optimization and Machine Learning

Abstract

Extracted data

Related items

Related items