Unsupervised Prototype Adapter for Vision-Language Models

Zhang, Yi
Zhang, Ce
Hu, Xueting
He, Zhihai

Publication date

August 2023

Language

English

Abstract

Recently, large-scale pre-trained vision-language models (e.g. CLIP and ALIGN) have demonstrated remarkable effectiveness in acquiring transferable visual representations. To leverage the valuable knowledge encoded within these models for downstream tasks, several fine-tuning approaches, including prompt tuning methods and adapter-based methods, have been developed to adapt vision-language models effectively with supervision. However, these methods rely on the availability of annotated samples, which can be labor-intensive and time-consuming to acquire, thus limiting scalability. To address this issue, in this work, we design an unsupervised fine-tuning approach for vision-language models called Unsupervised Prototype Adapter (UP-Adapter). ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Unsupervised Prototype Adapter for Vision-Language Models

Abstract

Extracted data

Unsupervised Prototype Adapter for Vision-Language Models

Abstract

Extracted data

Related items

Related items