In this paper, we present a unified representation based on the spatio-temporal steerable pyramid (STSP) for the holistic representation of human actions. A video sequence is viewed as a spatio-temporal volume preserving all the appearance and motion information of an action in it. By decomposing the spatio-temporal volumes into band-passed sub-volumes, the spatio-temporal Laplacian pyramid provides an effective technique for multi-scale analysis of video sequences, and spatio-temporal patterns with different scales could be well localized and captured. To efficiently explore the underlying local spatio-temporal orientation structures at multiple scales, a bank of three-dimensional separable steerable filters are conducted on each of the su...
We propose a novel method to model human actions by explicitly coding motion and structure features ...
In this paper we propose a new method for human action categorization by using an effective combinat...
In most of the existing work for activity recognition, 3D ConvNets show promising performance for le...
In this paper, we propose a novel holistic representation based on the spatio-temporal steerable pyr...
We present a novel descriptor, called spatio-temporal Laplacian pyramid coding (STLPC), for holistic...
Abstract—We present a novel descriptor, called spatio-temporal Laplacian pyramid coding (STLPC), for...
Historically, researchers in the field have spent a great deal of effort to create image representat...
Abstract—This paper provides a unified framework for the interrelated topics of action spotting, the...
In this paper, we present a new geometric-temporal representation for visual action recognition base...
The automatic analysis of video sequences with individuals performing some actions is currently rec...
Human action recognition, as one of the most important topics in computer vision, has been extensive...
In this paper we propose a novel method for human action recognition based on boosted key-frame sele...
Human action recognition has drawn much attention in the field of video analysis. In this paper, we ...
This paper introduces a method for human action recognition based on optical flow motion features ex...
This thesis presents a framework for automatic recognition of human actions in uncontrolled, realist...
We propose a novel method to model human actions by explicitly coding motion and structure features ...
In this paper we propose a new method for human action categorization by using an effective combinat...
In most of the existing work for activity recognition, 3D ConvNets show promising performance for le...
In this paper, we propose a novel holistic representation based on the spatio-temporal steerable pyr...
We present a novel descriptor, called spatio-temporal Laplacian pyramid coding (STLPC), for holistic...
Abstract—We present a novel descriptor, called spatio-temporal Laplacian pyramid coding (STLPC), for...
Historically, researchers in the field have spent a great deal of effort to create image representat...
Abstract—This paper provides a unified framework for the interrelated topics of action spotting, the...
In this paper, we present a new geometric-temporal representation for visual action recognition base...
The automatic analysis of video sequences with individuals performing some actions is currently rec...
Human action recognition, as one of the most important topics in computer vision, has been extensive...
In this paper we propose a novel method for human action recognition based on boosted key-frame sele...
Human action recognition has drawn much attention in the field of video analysis. In this paper, we ...
This paper introduces a method for human action recognition based on optical flow motion features ex...
This thesis presents a framework for automatic recognition of human actions in uncontrolled, realist...
We propose a novel method to model human actions by explicitly coding motion and structure features ...
In this paper we propose a new method for human action categorization by using an effective combinat...
In most of the existing work for activity recognition, 3D ConvNets show promising performance for le...