Gaze reflects how humans process visual scenes and is therefore increasingly used in computer vision systems. Previous works demonstrated the potential of gaze for object-centric tasks, such as object localization and recognition, but it remains unclear if gaze can also be beneficial for scene-centric tasks, such as image captioning. We present a new perspective on gaze-assisted image captioning by studying the interplay between human gaze and the attention mechanism of deep neural networks. Using a public large-scale gaze dataset, we first assess the relationship between state-of-the-art object and scene recognition models, bottom-up visual saliency, and human gaze. We then propose a novel split attention model for image captioning. Our mo...
Humans and other primates shift their gaze to allocate processing resources to a subset of the visua...
Estimating the focus of attention of a person looking at an image or a video is a crucial step which...
Image and video captioning are important tasks in visual data analytics, as they concern the capabil...
Gaze reflects how humans process visual scenes and is therefore increasingly used in computer vision...
Image captioning has been recently gaining a lot of attention thanks to the impressive achievements ...
In this work, we present a novel dataset consisting of eye movements and verbal descriptions recorde...
Deep Convolutional Neural Networks (DCNNs) were originally inspired by principles of biological visi...
Attention mechanisms have recently been introduced in deep learning for various tasks in natural lan...
Inspired by recent work in machine translation and object detection, we introduce an attention based...
Humans have the remarkable ability to follow the gaze of other people to identify what they are look...
What does human gaze reveal about a users' intents and to which extend can these intents be inferred...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
We posit that a person's gaze behavior while freely viewing a scene contains an abundance of informa...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
Automatic generation of captions for a given image is an active research area in Artificial Intel...
Humans and other primates shift their gaze to allocate processing resources to a subset of the visua...
Estimating the focus of attention of a person looking at an image or a video is a crucial step which...
Image and video captioning are important tasks in visual data analytics, as they concern the capabil...
Gaze reflects how humans process visual scenes and is therefore increasingly used in computer vision...
Image captioning has been recently gaining a lot of attention thanks to the impressive achievements ...
In this work, we present a novel dataset consisting of eye movements and verbal descriptions recorde...
Deep Convolutional Neural Networks (DCNNs) were originally inspired by principles of biological visi...
Attention mechanisms have recently been introduced in deep learning for various tasks in natural lan...
Inspired by recent work in machine translation and object detection, we introduce an attention based...
Humans have the remarkable ability to follow the gaze of other people to identify what they are look...
What does human gaze reveal about a users' intents and to which extend can these intents be inferred...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
We posit that a person's gaze behavior while freely viewing a scene contains an abundance of informa...
Visual attention plays an important role to understand images and demonstrates its effectiveness in ...
Automatic generation of captions for a given image is an active research area in Artificial Intel...
Humans and other primates shift their gaze to allocate processing resources to a subset of the visua...
Estimating the focus of attention of a person looking at an image or a video is a crucial step which...
Image and video captioning are important tasks in visual data analytics, as they concern the capabil...