Aarhus University

Publication date

December 2014

Abstract

Several methods for training feed-forward neural networks require second order information from the Hessian matrix of the error function. Although it is possible to calculate the Hessian matrix exactly it is often not desirable because of the computation and memory requirements involved. Some learning techniques does, however, only need the Hessian matrix times a vector. This paper presents a method to calculate the Hessian matrix times a vector inO(N) time, whereN is the number of variables in the network. This is in the same order as the calculation of the gradient to the error function. The usefulness of this algorithm is demonstrated by improvement of existing learning techniques.

Extracted data

We use cookies to provide a better user experience.

Data Protection

Aarhus University

Abstract

Extracted data

Aarhus University

Abstract

Extracted data

Related items

Related items