Compressing Transformers: Features Are Low-Rank, but Weights Are Not!

Yu, Hao
Wu, Jianxin

Open link

Publication date

June 2023

DOI

10.1609/aaai.v37i9.26304

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

Transformer and its variants achieve excellent results in various computer vision and natural language processing tasks, but high computational costs and reliance on large training datasets restrict their deployment in resource-constrained settings. Low-rank approximation of model weights has been effective in compressing CNN models, but its application to transformers has been less explored and is less effective. Existing methods require the complete dataset to fine-tune compressed models, which are both time-consuming and data-hungry. This paper reveals that the features (i.e., activations) are low-rank, but model weights are surprisingly not low-rank. Hence, AAFM is proposed, which adaptively determines the compressed model structure an...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Compressing Transformers: Features Are Low-Rank, but Weights Are Not!

Abstract

Extracted data

Compressing Transformers: Features Are Low-Rank, but Weights Are Not!

Abstract

Extracted data

Related items

Related items