Nonlinear Initialization Methods for Low-Rank Neural Networks

Vodrahalli, Kiran
Shivanna, Rakesh
Sathiamoorthy, Maheswaran
Jain, Sagar
Chi, Ed H.

Publication date

May 2022

Abstract

We propose a novel low-rank initialization framework for training low-rank deep neural networks -- networks where the weight parameters are re-parameterized by products of two low-rank matrices. The most successful prior existing approach, spectral initialization, draws a sample from the initialization distribution for the full-rank setting and then optimally approximates the full-rank initialization parameters in the Frobenius norm with a pair of low-rank initialization matrices via singular value decomposition. Our method is inspired by the insight that approximating the function corresponding to each layer is more important than approximating the parameter values. We provably demonstrate that there is a significant gap between these two ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Nonlinear Initialization Methods for Low-Rank Neural Networks

Abstract

Extracted data

Nonlinear Initialization Methods for Low-Rank Neural Networks

Abstract

Extracted data

Related items

Related items