LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning

Sung, Yi-Lin
Cho, Jaemin
Bansal, Mohit

Publication date

June 2022

Abstract

Fine-tuning large pre-trained models on downstream tasks has been adopted in a variety of domains recently. However, it is costly to update the entire parameter set of large pre-trained models. Although recently proposed parameter-efficient transfer learning (PETL) techniques allow updating a small subset of parameters (e.g. only using 2% of parameters) inside a pre-trained backbone network for a new task, they only reduce the training memory requirement by up to 30%. This is because the gradient computation for the trainable parameters still requires backpropagation through the large pre-trained backbone model. To address this, we propose Ladder Side-Tuning (LST), a new PETL technique that reduces training memory requirements by more subst...

Extracted data

We use cookies to provide a better user experience.

Data Protection

LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning

Abstract

Extracted data

LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning

Abstract

Extracted data

Related items

Related items