This paper presents a generic framework in which images are modelled as order-less sets of weighted visual features. Each visual feature is associated with a weight factor that may inform its relevance. This framework can be applied to various bag-of-features approaches such as the bag-of-visual-word or the Fisher kernel representations. We suggest that if dense sampling is used, different schemes to weight local features can be evaluated, leading to results that are often better than the combination of multiple sampling schemes, at a much lower computational cost, because the features are extracted only once. This allows our framework to be a test-bed for saliency estimation methods in image categorisation tasks. We explored two main possi...
In this paper, a novel multi-scale, statistical approach for natural image representation is present...
Based on the bottom-up human visual attention model, a procedure for feature description and image r...
Visual Saliency aims to detect the most important regions of an image from a perceptual point of vie...
This work explores attention models to weight the contribution of local convolutional representation...
University of Minnesota Ph.D. dissertation. December 2012. Major: Computer Science. Advisor: Nikolao...
Saliency algorithms in content-based image retrieval are employed to retrieve the most important reg...
Visual saliency computation is about detecting and understanding salient regions and elements in a v...
To detect visually salient elements of complex natural scenes, computational bottom-up saliency mode...
People perceive any kind of information with different level of attention and involvement. It is due...
In the first seconds of observation of an image, several visual attention processes are involved in ...
Image representation for content-based scene recognition is considered as one of the most challengin...
Visual Category Recognition aims at fast classification of objects, as well as scenery, action, and ...
Narayan V, Tscherepanow M, Wrede B. A saliency map based on sampling an image into random rectangula...
Incorporating models of human perception into the process of scene interpretation and object recogni...
A salient image region is defined as an image part that is clearly different from its surround in te...
In this paper, a novel multi-scale, statistical approach for natural image representation is present...
Based on the bottom-up human visual attention model, a procedure for feature description and image r...
Visual Saliency aims to detect the most important regions of an image from a perceptual point of vie...
This work explores attention models to weight the contribution of local convolutional representation...
University of Minnesota Ph.D. dissertation. December 2012. Major: Computer Science. Advisor: Nikolao...
Saliency algorithms in content-based image retrieval are employed to retrieve the most important reg...
Visual saliency computation is about detecting and understanding salient regions and elements in a v...
To detect visually salient elements of complex natural scenes, computational bottom-up saliency mode...
People perceive any kind of information with different level of attention and involvement. It is due...
In the first seconds of observation of an image, several visual attention processes are involved in ...
Image representation for content-based scene recognition is considered as one of the most challengin...
Visual Category Recognition aims at fast classification of objects, as well as scenery, action, and ...
Narayan V, Tscherepanow M, Wrede B. A saliency map based on sampling an image into random rectangula...
Incorporating models of human perception into the process of scene interpretation and object recogni...
A salient image region is defined as an image part that is clearly different from its surround in te...
In this paper, a novel multi-scale, statistical approach for natural image representation is present...
Based on the bottom-up human visual attention model, a procedure for feature description and image r...
Visual Saliency aims to detect the most important regions of an image from a perceptual point of vie...