Minimal random code learning: Getting bits back from compressed model parameters

Havasi, Marton
Peharz, Robert
Hernández-Lobato, José Miguel

Publication date

January 2019

Abstract

While deep neural networks are a highly successful model class, their large memory footprint puts considerable strain on energy consumption, communication bandwidth, and storage requirements. Consequently, model size reduction has become an utmost goal in deep learning. A typical approach is to train a set of deterministic weights, while applying certain techniques such as pruning and quantization, in order that the empirical weight distribution becomes amenable to Shannon-style coding schemes. However, as shown in this paper, relaxing weight determinism and using a full variational distribution over weights allows for more efficient coding schemes and consequently higher compression rates. In particular, following the classical bits-back a...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Minimal random code learning: Getting bits back from compressed model parameters

Abstract

Extracted data

Minimal random code learning: Getting bits back from compressed model parameters

Abstract

Extracted data

Related items

Related items