Robust fine-tuning of zero-shot models

Wortsman, Mitchell
Ilharco, Gabriel
Kim, Jong Wook
Li, Mike
Kornblith, Simon
Roelofs, Rebecca
Gontijo-Lopes, Raphael
Hajishirzi, Hannaneh
Farhadi, Ali
Namkoong, Hongseok
Schmidt, Ludwig

Publication date

June 2022

Abstract

Large pre-trained models such as CLIP or ALIGN offer consistent accuracy across a range of data distributions when performing zero-shot inference (i.e., without fine-tuning on a specific dataset). Although existing fine-tuning methods substantially improve accuracy on a given target distribution, they often reduce robustness to distribution shifts. We address this tension by introducing a simple and effective method for improving robustness while fine-tuning: ensembling the weights of the zero-shot and fine-tuned models (WiSE-FT). Compared to standard fine-tuning, WiSE-FT provides large accuracy improvements under distribution shift, while preserving high accuracy on the target distribution. On ImageNet and five derived distribution shifts,...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Robust fine-tuning of zero-shot models

Abstract

Extracted data

Robust fine-tuning of zero-shot models

Abstract

Extracted data

Topics

Related items

Topics

Related items