Fine-grained object retrieval aims to learn discriminative representation to retrieve visually similar objects. However, existing top-performing works usually impose pairwise similarities on the semantic embedding spaces or design a localization sub-network to continually fine-tune the entire model in limited data scenarios, thus resulting in convergence to suboptimal solutions. In this paper, we develop Fine-grained Retrieval Prompt Tuning (FRPT), which steers a frozen pre-trained model to perform the fine-grained retrieval task from the perspectives of sample prompting and feature adaptation. Specifically, FRPT only needs to learn fewer parameters in the prompt and adaptation instead of fine-tuning the entire model, thus solving the issue...
Enhancing the zero-shot performance of instruction-following models requires heavy computation, eith...
Fine-grained image recognition is a longstanding computer vision challenge that focuses on different...
Delicate attention of the discriminative regions plays a critical role in Fine-Grained Visual Catego...
We present a new paradigm for fine-tuning large-scale vision-language pre-trained models on downstre...
Finetuning can be used to tackle domain specific tasks by transferring knowledge learned from pre-tr...
In this paper we develop a representation for fine-grained retrieval. Given a query, we want to retr...
Recent studies show that prompt tuning can better leverage the power of large language models than f...
Recognizing discriminative details such as eyes and beaks is important for distinguishing fine-grain...
Prompt tuning attempts to update few task-specific parameters in pre-trained models. It has achieved...
Dense retrieval (DR) converts queries and documents into dense embeddings and measures the similarit...
With the development of computational power and techniques for data collection, deep learning demons...
Recent advances on fine-grained image retrieval prefer learning convolutional neural network (CNN) w...
Existing fine-tuning methods either tune all parameters of the pre-trained model (full fine-tuning),...
Fine-tuning from a collection of models pre-trained on different domains (a “model zoo”) is emerging...
Image retrieval targets to find images from a database that are visually similar to the query image....
Enhancing the zero-shot performance of instruction-following models requires heavy computation, eith...
Fine-grained image recognition is a longstanding computer vision challenge that focuses on different...
Delicate attention of the discriminative regions plays a critical role in Fine-Grained Visual Catego...
We present a new paradigm for fine-tuning large-scale vision-language pre-trained models on downstre...
Finetuning can be used to tackle domain specific tasks by transferring knowledge learned from pre-tr...
In this paper we develop a representation for fine-grained retrieval. Given a query, we want to retr...
Recent studies show that prompt tuning can better leverage the power of large language models than f...
Recognizing discriminative details such as eyes and beaks is important for distinguishing fine-grain...
Prompt tuning attempts to update few task-specific parameters in pre-trained models. It has achieved...
Dense retrieval (DR) converts queries and documents into dense embeddings and measures the similarit...
With the development of computational power and techniques for data collection, deep learning demons...
Recent advances on fine-grained image retrieval prefer learning convolutional neural network (CNN) w...
Existing fine-tuning methods either tune all parameters of the pre-trained model (full fine-tuning),...
Fine-tuning from a collection of models pre-trained on different domains (a “model zoo”) is emerging...
Image retrieval targets to find images from a database that are visually similar to the query image....
Enhancing the zero-shot performance of instruction-following models requires heavy computation, eith...
Fine-grained image recognition is a longstanding computer vision challenge that focuses on different...
Delicate attention of the discriminative regions plays a critical role in Fine-Grained Visual Catego...