Image search can be tackled using deep features from pre-trained Convolutional Neural Networks (CNN). The feature map from the last convolutional layer of a CNN encodes descriptive information from which a discriminative global descriptor can be obtained. We propose a new representation of co-occurrences from deep convolutional features to extract additional relevant information from this last convolutional layer. Combining this co-occurrence map with the feature map, we achieve an improved image representation. We present two different methods to get the co-occurrence representation, the first one based on direct aggregation of activations, and the second one, based on a trainable co-occurrence representation. The image descriptors derived...
© 2017 Association for Computing Machinery. The Bag-of-Words (BoW) models using the SIFT descriptors...
Image representations extracted from convolutional neural networks (CNNs) outdo hand-crafted feature...
Image retrieval in realistic scenarios targets large dynamic datasets of unlabeled images. In these ...
International audienceRecently, image representation built upon Convolutional Neural Network (CNN) h...
International audienceRecently, image representation built upon Convolutional Neural Network (CNN) h...
In recent years, instance-level-image retrieval has attracted massive attention. Several researchers...
International audienceRecently, image representation built upon Convolutional Neural Network (CNN) h...
The recent advances brought by deep learning allowed to improve the performance in image retrieval t...
International audiencePatch-level descriptors underlie several important computer vision tasks, such...
© 2015 IEEE. In this paper we evaluate the quality of the activation layers of a convolutional neura...
In recent years, the expansion of the Internet has brought an explosion of visual information, inclu...
2016 IEEE.A major component of a generic image retrieval pipeline is producing concise and effective...
The purpose of mid-level visual element discovery is to find clusters of image patches that are repr...
Image representations extracted from convolutional neural networks (CNNs) outdo hand-crafted feature...
This paper addresses the problem of very large-scale image retrieval, focusing on improving its accu...
© 2017 Association for Computing Machinery. The Bag-of-Words (BoW) models using the SIFT descriptors...
Image representations extracted from convolutional neural networks (CNNs) outdo hand-crafted feature...
Image retrieval in realistic scenarios targets large dynamic datasets of unlabeled images. In these ...
International audienceRecently, image representation built upon Convolutional Neural Network (CNN) h...
International audienceRecently, image representation built upon Convolutional Neural Network (CNN) h...
In recent years, instance-level-image retrieval has attracted massive attention. Several researchers...
International audienceRecently, image representation built upon Convolutional Neural Network (CNN) h...
The recent advances brought by deep learning allowed to improve the performance in image retrieval t...
International audiencePatch-level descriptors underlie several important computer vision tasks, such...
© 2015 IEEE. In this paper we evaluate the quality of the activation layers of a convolutional neura...
In recent years, the expansion of the Internet has brought an explosion of visual information, inclu...
2016 IEEE.A major component of a generic image retrieval pipeline is producing concise and effective...
The purpose of mid-level visual element discovery is to find clusters of image patches that are repr...
Image representations extracted from convolutional neural networks (CNNs) outdo hand-crafted feature...
This paper addresses the problem of very large-scale image retrieval, focusing on improving its accu...
© 2017 Association for Computing Machinery. The Bag-of-Words (BoW) models using the SIFT descriptors...
Image representations extracted from convolutional neural networks (CNNs) outdo hand-crafted feature...
Image retrieval in realistic scenarios targets large dynamic datasets of unlabeled images. In these ...