The performance of predicting human fixations in videos has been much enhanced with the help of development of the convolutional neural networks (CNN). In this paper, we propose a novel end-to-end neural network “SalSAC” for video saliency prediction, which uses the CNN-LSTM-Attention as the basic architecture and utilizes the information from both static and dynamic aspects. To better represent the static information of each frame, we first extract multi-level features of same size from different layers of the encoder CNN and calculate the corresponding multi-level attentions, then we randomly shuffle these attention maps among levels and multiply them to the extracted multi-level features respectively. Through this way, we leverage the at...
Visual saliency prediction using transformers - Convolutional neural networks (CNNs) have significan...
International audiencePrediction of visual saliency in images and video is a highly researched topic...
This paper presents a novel deep architecture for saliency prediction. Current state of the art mode...
The performance of predicting human fixations in videos has been much enhanced with the help of deve...
In this work, we contribute to video saliency research in two ways. First, we introduce a new benchm...
This paper investigates modifying an existing neural network architecture for static saliency predic...
In this paper, we propose a novel 3D CNN architecture that enables us to train an effective video sa...
pp 508-513International audienceWhen viewing video sequences, the human visual system (HVS) tends to...
Semantics and motion are two cues of essence for the success in video salient object detection. Most...
Data-driven saliency has recently gained a lot of attention thanks to the use of Convolutional Neura...
Estimating the focus of attention of a person looking at an image or a video is a crucial step which...
The prediction of salient areas in images has been traditionally addressed with hand-crafted feature...
Data-driven saliency has recently gained a lot of attention thanks to the use of convolutional neura...
This paper focuses on the problem of visual saliency prediction, predicting regions of an image that...
Predicting visual attention is a very active field in the computer vision community. Visual attentio...
Visual saliency prediction using transformers - Convolutional neural networks (CNNs) have significan...
International audiencePrediction of visual saliency in images and video is a highly researched topic...
This paper presents a novel deep architecture for saliency prediction. Current state of the art mode...
The performance of predicting human fixations in videos has been much enhanced with the help of deve...
In this work, we contribute to video saliency research in two ways. First, we introduce a new benchm...
This paper investigates modifying an existing neural network architecture for static saliency predic...
In this paper, we propose a novel 3D CNN architecture that enables us to train an effective video sa...
pp 508-513International audienceWhen viewing video sequences, the human visual system (HVS) tends to...
Semantics and motion are two cues of essence for the success in video salient object detection. Most...
Data-driven saliency has recently gained a lot of attention thanks to the use of Convolutional Neura...
Estimating the focus of attention of a person looking at an image or a video is a crucial step which...
The prediction of salient areas in images has been traditionally addressed with hand-crafted feature...
Data-driven saliency has recently gained a lot of attention thanks to the use of convolutional neura...
This paper focuses on the problem of visual saliency prediction, predicting regions of an image that...
Predicting visual attention is a very active field in the computer vision community. Visual attentio...
Visual saliency prediction using transformers - Convolutional neural networks (CNNs) have significan...
International audiencePrediction of visual saliency in images and video is a highly researched topic...
This paper presents a novel deep architecture for saliency prediction. Current state of the art mode...