When free-viewing scenes, the first few fixations of human observers are driven in part by bottom-up attention. We seek to characterize this process by extracting all information from images that can be used to predict fixation densities (Kuemmerer et al, PNAS, 2015). If we ignore time and observer identity, the average amount of information is slightly larger than 2 bits per image for the MIT 1003 dataset. The minimum amount of information is 0.3 bits and the maximum 5.2 bits. Before the rise of deep neural networks the best models were able to capture 1/3 of this information on average. We developed new saliency algorithms based on high-performing convolutional neural networks such as AlexNet or VGG-19 that have been shown to provide gene...
Recent results suggest that state-of-the-art saliency models perform far from optimal in predicting ...
AbstractEye tracking has become the de facto standard measure of visual attention in tasks that rang...
The problem of predicting where people look at, or equivalently salient region detection, has been r...
When free-viewing scenes, the first few fixations of human observers are driven in part by bottom-up...
When free-viewing scenes, the first few fixations of human observers are driven in part by bottom-up...
Learning what properties of an image are associated with human gaze placement is important both for ...
Where humans choose to look can tell us a lot about behaviour in a variety of tasks. Over the last d...
Deep convolutional neural networks have demonstrated high performances for fixation prediction in r...
Predicting where humans choose to fixate can help understanding a variety of human behaviour. The la...
Understanding where people look in images is an important problem in computer vision. Despite signif...
Deep saliency models represent the current state-of-the-art for predicting where humans look in real...
Understanding and predicting the human visual attention mechanism is an active area of research in t...
Estimating the focus of attention of a person looking at an image or a video is a crucial step which...
Under natural viewing conditions, human observers shift their gaze to allocate processing resources ...
International audiencePrediction of visual saliency in images and video is a highly researched topic...
Recent results suggest that state-of-the-art saliency models perform far from optimal in predicting ...
AbstractEye tracking has become the de facto standard measure of visual attention in tasks that rang...
The problem of predicting where people look at, or equivalently salient region detection, has been r...
When free-viewing scenes, the first few fixations of human observers are driven in part by bottom-up...
When free-viewing scenes, the first few fixations of human observers are driven in part by bottom-up...
Learning what properties of an image are associated with human gaze placement is important both for ...
Where humans choose to look can tell us a lot about behaviour in a variety of tasks. Over the last d...
Deep convolutional neural networks have demonstrated high performances for fixation prediction in r...
Predicting where humans choose to fixate can help understanding a variety of human behaviour. The la...
Understanding where people look in images is an important problem in computer vision. Despite signif...
Deep saliency models represent the current state-of-the-art for predicting where humans look in real...
Understanding and predicting the human visual attention mechanism is an active area of research in t...
Estimating the focus of attention of a person looking at an image or a video is a crucial step which...
Under natural viewing conditions, human observers shift their gaze to allocate processing resources ...
International audiencePrediction of visual saliency in images and video is a highly researched topic...
Recent results suggest that state-of-the-art saliency models perform far from optimal in predicting ...
AbstractEye tracking has become the de facto standard measure of visual attention in tasks that rang...
The problem of predicting where people look at, or equivalently salient region detection, has been r...