We first propose a new spatio-temporal context distri-bution feature of interest points for human action recog-nition. Each action video is expressed as a set of relative XYT coordinates between pairwise interest points in a lo-cal region. We learn a global GMM (referred to as Uni-versal Background Model, UBM) using the relative coordi-nate features from all the training videos, and then repre-sent each video as the normalized parameters of a video-specific GMM adapted from the global GMM. In order to capture the spatio-temporal relationships at different lev-els, multiple GMMs are utilized to describe the context dis-tributions of interest points over multi-scale local regions. To describe the appearance information of an action video, we ...
Automated human action recognition plays a critical role in the development of human-machine communi...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
This paper considers the recognition of realistic human actions in videos based on spatio-temporal i...
The performance of action recognition in video sequences depends significantly on the representation...
The human action classification task is a widely researched topic and is still an open problem. Many...
Common approaches to human action recognition from images rely on local descriptors for classificati...
How should a video be represented? We propose a new representation for videos based on mid-level dis...
Human action recognition is challenging mainly due to intro-variety, inter-ambiguity and clutter bac...
In this paper we propose a novel framework for action recognition based on multiple features for imp...
This paper presents a generic method for recognising and localising human actions in video based sol...
"Actions in the wild" is the term given to examples of human motion that are performed in natural se...
This paper deals with human action classification by utilizing spatio-temporal (ST) co-occurrences b...
In this paper, we present a novel approach for automatically learning a compact and yet discriminati...
We present a novel model for human action categoriza-tion. A video sequence is represented as a coll...
A common approach to human action recognition from still images consists in computing local descript...
Automated human action recognition plays a critical role in the development of human-machine communi...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
This paper considers the recognition of realistic human actions in videos based on spatio-temporal i...
The performance of action recognition in video sequences depends significantly on the representation...
The human action classification task is a widely researched topic and is still an open problem. Many...
Common approaches to human action recognition from images rely on local descriptors for classificati...
How should a video be represented? We propose a new representation for videos based on mid-level dis...
Human action recognition is challenging mainly due to intro-variety, inter-ambiguity and clutter bac...
In this paper we propose a novel framework for action recognition based on multiple features for imp...
This paper presents a generic method for recognising and localising human actions in video based sol...
"Actions in the wild" is the term given to examples of human motion that are performed in natural se...
This paper deals with human action classification by utilizing spatio-temporal (ST) co-occurrences b...
In this paper, we present a novel approach for automatically learning a compact and yet discriminati...
We present a novel model for human action categoriza-tion. A video sequence is represented as a coll...
A common approach to human action recognition from still images consists in computing local descript...
Automated human action recognition plays a critical role in the development of human-machine communi...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
This paper considers the recognition of realistic human actions in videos based on spatio-temporal i...