Deep Learning in Target Space

Fairbank, Michael
Samothrakis, Spyridon
Citi, Luca

Publication date

February 2022

Publisher

Microtome Publishing

Abstract

Deep learning uses neural networks which are parameterised by their weights. The neural networks are usually trained by tuning the weights to directly minimise a given loss function. In this paper we propose to re-parameterise the weights into targets for the firing strengths of the individual nodes in the network. Given a set of targets, it is possible to calculate the weights which make the firing strengths best meet those targets. It is argued that using targets for training addresses the problem of exploding gradients, by a process which we call cascade untangling, and makes the loss-function surface smoother to traverse, and so leads to easier, faster training, and also potentially better generalisation, of the neural network. It a...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Deep Learning in Target Space

Abstract

Extracted data

Deep Learning in Target Space

Abstract

Extracted data

Related items

Related items