Does a sparse ReLU network training problem always admit an optimum?

Le, Quoc-Tung
Riccietti, Elisa
Gribonval, Rémi

Publication date

December 2023

Publisher

HAL CCSD

Abstract

International audienceGiven a training set, a loss function, and a neural network architecture, it is often taken for granted that optimal network parameters exist, and a common practice is to apply available optimization algorithms to search for them. In this work, we show that the existence of an optimal solution is not always guaranteed, especially in the context of {\em sparse} ReLU neural networks. In particular, we first show that optimization problems involving deep networks with certain sparsity patterns do not always have optimal parameters, and that optimization algorithms may then diverge. Via a new topological relation between sparse ReLU neural networks and their linear counterparts, we derive --using existing tools from real ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Does a sparse ReLU network training problem always admit an optimum?

Abstract

Extracted data

Does a sparse ReLU network training problem always admit an optimum?

Abstract

Extracted data

Related items

Related items