Deeply learned representations have achieved superior image retrieval performance in a retrieve-then-rerank manner. Recent state-of-the-art single stage model, which heuristically fuses local and global features, achieves promising trade-off between efficiency and effectiveness. However, we notice that efficiency of existing solutions is still restricted because of their multi-scale inference paradigm. In this paper, we follow the single stage art and obtain further complexity-effectiveness balance by successfully getting rid of multi-scale testing. To achieve this goal, we abandon the widely-used convolution network giving its limitation in exploring diverse visual patterns, and resort to fully attention based framework for robust represen...
Re-Identification (Re-ID) is a fundamental computer vision task, which refers to associating targets...
Visual attention selects data considered as “interesting” by humans, and it is modeled in the field ...
The need for Explainable AI is increasing with the development of deep learning. The saliency maps d...
In the recent advancements attention mechanism in deep learning had played a vital role in proving b...
The classification of gigapixel histopathology images with deep multiple instance learning models ha...
Vision Transformers achieved outstanding performance in many computer vision tasks. Early Vision Tra...
Vision Transformer (ViT) attains state-of-the-art performance in visual recognition, and the variant...
Local feature provides compact and invariant image representation for various visual tasks. Current ...
Deep neural networks have reached human-level performance on many computer vision tasks. However, th...
Image matching is a central component in many computer vision applications. The field has progressed...
We propose a unified look at jointly learning multiple vision tasks and visual domains through unive...
We address the problem of visual instance search, which consists to retrieve all the images within a...
This paper addresses the problem of very large-scale image retrieval, focusing on improving its accu...
In this paper, we propose a general framework for image classification using the attention mechanism...
Image representation plays an important part in many real-world multimedia ap- plications. Over the ...
Re-Identification (Re-ID) is a fundamental computer vision task, which refers to associating targets...
Visual attention selects data considered as “interesting” by humans, and it is modeled in the field ...
The need for Explainable AI is increasing with the development of deep learning. The saliency maps d...
In the recent advancements attention mechanism in deep learning had played a vital role in proving b...
The classification of gigapixel histopathology images with deep multiple instance learning models ha...
Vision Transformers achieved outstanding performance in many computer vision tasks. Early Vision Tra...
Vision Transformer (ViT) attains state-of-the-art performance in visual recognition, and the variant...
Local feature provides compact and invariant image representation for various visual tasks. Current ...
Deep neural networks have reached human-level performance on many computer vision tasks. However, th...
Image matching is a central component in many computer vision applications. The field has progressed...
We propose a unified look at jointly learning multiple vision tasks and visual domains through unive...
We address the problem of visual instance search, which consists to retrieve all the images within a...
This paper addresses the problem of very large-scale image retrieval, focusing on improving its accu...
In this paper, we propose a general framework for image classification using the attention mechanism...
Image representation plays an important part in many real-world multimedia ap- plications. Over the ...
Re-Identification (Re-ID) is a fundamental computer vision task, which refers to associating targets...
Visual attention selects data considered as “interesting” by humans, and it is modeled in the field ...
The need for Explainable AI is increasing with the development of deep learning. The saliency maps d...