Gaze reflects how humans process visual scenes and is therefore increasingly used in computer vision systems. Previous works demonstrated the potential of gaze for object-centric tasks, such as object localization and recognition, but it remains unclear if gaze can also be beneficial for scene-centric tasks, such as image captioning. We present a new perspective on gaze-assisted image captioning by studying the interplay between human gaze and the attention mechanism of deep neural networks. Using a public large-scale gaze dataset, we first assess the relationship between state-of-the-art object and scene recognition models, bottom-up visual saliency, and human gaze. We then propose a novel split attention model for image captioning. Our mo...
Thanks to deep learning, computer vision has advanced by a large margin. Attention mechanism, inspir...
Previous studies of eye gaze have shown that when looking at images containing human faces, observer...
Deep Convolutional Neural Networks (DCNNs) were originally inspired by principles of biological visi...
Gaze reflects how humans process visual scenes and is therefore increasingly used in computer vision...
In this work, we present a novel dataset consisting of eye movements and verbal descriptions recorde...
Image captioning has been recently gaining a lot of attention thanks to the impressive achievements ...
Data-driven saliency has recently gained a lot of attention thanks to the use of convolutional neura...
Image and video captioning are important tasks in visual data analytics, as they concern the capabil...
Image captioning has been recently gaining a lot of attention thanks to the impressive achievements ...
Image and video captioning are important tasks in visual data analytics, as they concern the capabil...
We posit that a person's gaze behavior while freely viewing a scene contains an abundance of informa...
We posit that a person's gaze behavior while freely viewing a scene contains an abundance of informa...
Image captioning has been recently gaining a lot of attention thanks to the impressive achievements ...
Data-driven saliency has recently gained a lot of attention thanks to the use of Convolutional Neura...
Data-driven saliency has recently gained a lot of attention thanks to the use of Convolutional Neura...
Thanks to deep learning, computer vision has advanced by a large margin. Attention mechanism, inspir...
Previous studies of eye gaze have shown that when looking at images containing human faces, observer...
Deep Convolutional Neural Networks (DCNNs) were originally inspired by principles of biological visi...
Gaze reflects how humans process visual scenes and is therefore increasingly used in computer vision...
In this work, we present a novel dataset consisting of eye movements and verbal descriptions recorde...
Image captioning has been recently gaining a lot of attention thanks to the impressive achievements ...
Data-driven saliency has recently gained a lot of attention thanks to the use of convolutional neura...
Image and video captioning are important tasks in visual data analytics, as they concern the capabil...
Image captioning has been recently gaining a lot of attention thanks to the impressive achievements ...
Image and video captioning are important tasks in visual data analytics, as they concern the capabil...
We posit that a person's gaze behavior while freely viewing a scene contains an abundance of informa...
We posit that a person's gaze behavior while freely viewing a scene contains an abundance of informa...
Image captioning has been recently gaining a lot of attention thanks to the impressive achievements ...
Data-driven saliency has recently gained a lot of attention thanks to the use of Convolutional Neura...
Data-driven saliency has recently gained a lot of attention thanks to the use of Convolutional Neura...
Thanks to deep learning, computer vision has advanced by a large margin. Attention mechanism, inspir...
Previous studies of eye gaze have shown that when looking at images containing human faces, observer...
Deep Convolutional Neural Networks (DCNNs) were originally inspired by principles of biological visi...