The Gauss-Newton matrix for Deep Learning models and its applications

Botev, Aleksandar

Open PDF

Open link

Publication date

December 2020

Publisher

UCL (University College London)

Language

English

Abstract

Deep Learning learning has recently become one of the most predominantly used techniques in the field of Machine Learning. Optimising these models, however, is very difficult and in order to scale the training to large datasets and model sizes practitioners use first-order optimisation methods. One of the main challenges of using the more sophisticated second-order optimisation methods is that the curvature matrices of the loss surfaces of neural networks are usually intractable, which is an open avenue for research. In this work, we investigate the Gauss-Newton matrix for neural networks and its application in different areas of Machine Learning. Firstly, we analyse the structure of the Hessian and Gauss-Newton matrices for Feed Forward Ne...

Extracted data

We use cookies to provide a better user experience.

Data Protection

The Gauss-Newton matrix for Deep Learning models and its applications

Abstract

Extracted data

The Gauss-Newton matrix for Deep Learning models and its applications

Abstract

Extracted data

Related items

Related items