Low-rank Compression of Neural Networks: LC Algorithms and Open-source Implementation

Idelbayev, Yerlan

Publication date

January 2021

Publisher

eScholarship, University of California

Abstract

Neural networks have gained widespread use in many machine learning tasks due to their state-of-the-art performance. However, the cost of this progress lies in the ever-increasing sizes and computational demands of the resulting models. As such, the neural network compression, the process of reducing the size, power consumption, or any other cost of interest of the model, has become an important practical step when deploying the trained models to perform inference tasks. In this dissertation, we explore a particular compression mechanism --- the low-rank decomposition --- and its extensions for the purposes of neural network compression. We study important aspects of the low-rank compression: how to select the decomposition ranks across the...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Low-rank Compression of Neural Networks: LC Algorithms and Open-source Implementation

Abstract

Extracted data

Low-rank Compression of Neural Networks: LC Algorithms and Open-source Implementation

Abstract

Extracted data

Related items

Related items