In this paper, we propose a novel 3D CNN architecture that enables us to train an effective video saliency prediction model. The model is designed to capture important motion information using multiple adjacent frames. Our model performs a cubic convolution on a set of consecutive frames to extract spatio-temporal fea- tures. This enables us to predict the saliency map for any given frame using past frames. We comprehensively investigate the performance of our model with respect to state-of-the-art video saliency models. Experimental results on three large-scale datasets, DHF1K, UCF-SPORTS and DAVIS, demonstrate the competitiveness of our approach
During recent years remarkable progress has been made in visual saliency modeling. Our interest is i...
The objective of this thesis is to study the capabilities of 3D convolutional neural networks (CNN) ...
Deep learning approaches have been established as the main methodology for video classification and ...
In this paper, we propose a novel 3D CNN architecture that enables us to train an effective video sa...
Visual saliency is a probabilistic estimate of how likely a given spatial area in an image or video ...
Human activity recognition in videos with convolutional neural network (CNN) features has received i...
Visual saliency is a probabilistic estimate of how likely a given spa-tial area in an image or video...
The performance of predicting human fixations in videos has been much enhanced with the help of deve...
International audienceVisual attention is one of the most important mechanisms in the human visual p...
© 2016 IEEE. Human activity recognition in videos with convolutional neural network (CNN) features h...
In this work, we contribute to video saliency research in two ways. First, we introduce a new benchm...
Estimating the focus of attention of a person looking at an image or a video is a crucial step which...
Abstract — We describe a new 3D saliency prediction model that accounts for diverse low-level lumina...
International audiencePrediction of visual saliency in images and video is a highly researched topic...
International audienceIn this paper, we present a video-based emotion recognition neural network ope...
During recent years remarkable progress has been made in visual saliency modeling. Our interest is i...
The objective of this thesis is to study the capabilities of 3D convolutional neural networks (CNN) ...
Deep learning approaches have been established as the main methodology for video classification and ...
In this paper, we propose a novel 3D CNN architecture that enables us to train an effective video sa...
Visual saliency is a probabilistic estimate of how likely a given spatial area in an image or video ...
Human activity recognition in videos with convolutional neural network (CNN) features has received i...
Visual saliency is a probabilistic estimate of how likely a given spa-tial area in an image or video...
The performance of predicting human fixations in videos has been much enhanced with the help of deve...
International audienceVisual attention is one of the most important mechanisms in the human visual p...
© 2016 IEEE. Human activity recognition in videos with convolutional neural network (CNN) features h...
In this work, we contribute to video saliency research in two ways. First, we introduce a new benchm...
Estimating the focus of attention of a person looking at an image or a video is a crucial step which...
Abstract — We describe a new 3D saliency prediction model that accounts for diverse low-level lumina...
International audiencePrediction of visual saliency in images and video is a highly researched topic...
International audienceIn this paper, we present a video-based emotion recognition neural network ope...
During recent years remarkable progress has been made in visual saliency modeling. Our interest is i...
The objective of this thesis is to study the capabilities of 3D convolutional neural networks (CNN) ...
Deep learning approaches have been established as the main methodology for video classification and ...