Méthodes d'optimisation basées sur le gradient naturel pour les réseaux de neurones profonds

Koroko, Abdoulaye

Publication date

October 2023

Publisher

HAL CCSD

Abstract

The stochastic gradient method is currently the prevailing technology for training neural networks. Compared to the classical gradient descent method, the calculation of the true gradient as an average over the data is replaced by a random element of the sum. When dealing with massive data, this bold approximation enables one to decrease the number of elementary gradient evaluations and to alleviate the cost of each iteration. The price to be paid is the appearance of oscillations and the slowness of convergence, which is often excessive in terms of number of iterations. The aim of this thesis is to design an approach that is both: (i) more robust, using the fundamental methods that have been successfully proven in classical optimization, i...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Méthodes d'optimisation basées sur le gradient naturel pour les réseaux de neurones profonds

Abstract

Extracted data

Méthodes d'optimisation basées sur le gradient naturel pour les réseaux de neurones profonds

Abstract

Extracted data

Related items

Related items