Human action recognition is valuable for numerous practical applications, e.g., gaming, video surveillance, and video search. In this paper we hypothesize that the classification of actions can be boosted by designing a smart feature pooling strategy under the prevalently used bag-of-words-based representation. Founded on automatic video saliency analysis, we propose the spatial-temporal attention-aware pooling scheme for feature pooling. First, the video saliencies are predicted using the video saliency model, and the localized spatial-temporal features are pooled at different saliency levels and video-saliency-guided channels are formed. Saliency-aware matching kernels are thus derived as the similarity measurement of these channels. Intu...
Conference of 2013 14th IEEE International Conference on Computer Vision, ICCV 2013 ; Conference Dat...
With the availability of cheap video recording devices, fast internet access and huge storage spaces...
With the availability of cheap video recording devices, fast internet access and huge storage spaces...
Human actions are spatio-temporal patterns. A popular representation is to describe the action by fe...
We introduce a simple yet effective network that embeds a novel Discriminative Feature Pooling (DFP)...
Several spatiotemporal feature point detectors have been recently used in video analysis for action ...
Recognizing actions is one of the important challenges in computer vision with respect to video data...
International audienceWe address the problem of action recognition in unconstrained videos. We propo...
International audienceWe address the problem of action recognition in unconstrained videos. We propo...
This paper presents a novel framework for human action recognition based on salient object detection...
This paper presents a novel framework for human action recognition based on salient object detection...
This paper presents a novel framework for human action recognition based on salient object detection...
Conference of 2013 14th IEEE International Conference on Computer Vision, ICCV 2013 ; Conference Dat...
Conference of 2013 14th IEEE International Conference on Computer Vision, ICCV 2013 ; Conference Dat...
Human action recognition is challenging mainly due to intro-variety, inter-ambiguity and clutter bac...
Conference of 2013 14th IEEE International Conference on Computer Vision, ICCV 2013 ; Conference Dat...
With the availability of cheap video recording devices, fast internet access and huge storage spaces...
With the availability of cheap video recording devices, fast internet access and huge storage spaces...
Human actions are spatio-temporal patterns. A popular representation is to describe the action by fe...
We introduce a simple yet effective network that embeds a novel Discriminative Feature Pooling (DFP)...
Several spatiotemporal feature point detectors have been recently used in video analysis for action ...
Recognizing actions is one of the important challenges in computer vision with respect to video data...
International audienceWe address the problem of action recognition in unconstrained videos. We propo...
International audienceWe address the problem of action recognition in unconstrained videos. We propo...
This paper presents a novel framework for human action recognition based on salient object detection...
This paper presents a novel framework for human action recognition based on salient object detection...
This paper presents a novel framework for human action recognition based on salient object detection...
Conference of 2013 14th IEEE International Conference on Computer Vision, ICCV 2013 ; Conference Dat...
Conference of 2013 14th IEEE International Conference on Computer Vision, ICCV 2013 ; Conference Dat...
Human action recognition is challenging mainly due to intro-variety, inter-ambiguity and clutter bac...
Conference of 2013 14th IEEE International Conference on Computer Vision, ICCV 2013 ; Conference Dat...
With the availability of cheap video recording devices, fast internet access and huge storage spaces...
With the availability of cheap video recording devices, fast internet access and huge storage spaces...