Visual Prompt Tuning

Jia, Menglin
Tang, Luming
Chen, Bor-Chun
Cardie, Claire
Belongie, Serge
Hariharan, Bharath
Lim, Ser-Nam

Publication date

July 2022

Language

English

Abstract

The current modus operandi in adapting pre-trained models involves updating all the backbone parameters, ie, full fine-tuning. This paper introduces Visual Prompt Tuning (VPT) as an efficient and effective alternative to full fine-tuning for large-scale Transformer models in vision. Taking inspiration from recent advances in efficiently tuning large language models, VPT introduces only a small amount (less than 1% of model parameters) of trainable parameters in the input space while keeping the model backbone frozen. Via extensive experiments on a wide variety of downstream recognition tasks, we show that VPT achieves significant performance gains compared to other parameter efficient tuning protocols. Most importantly, VPT even outperforms...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Visual Prompt Tuning

Abstract

Extracted data

Visual Prompt Tuning

Abstract

Extracted data

Related items

Related items