Action recognition and pose estimation from video are closely related tasks for understanding human motion, most methods, however, learn separate models and combine them sequentially. In this paper, we propose a framework to in-tegrate training and testing of the two tasks. A spatial-temporal And-Or graph model is introduced to represent ac-tion at three scales. Specifically the action is decomposed into poses which are further divided to mid-level ST-parts and then parts. The hierarchical structure of our model captures the geometric and appearance variations of pose at each frame and lateral connections between ST-parts at adjacent frames capture the action-specific motion informa-tion. The model parameters for three scales are learned di...
Detecting actions in videos is still a demanding task due to large intra-class variation caused by v...
A grand challenge of computer vision is to enable machines to ``see people\u27\u27. A solution to th...
Human action recognition aims to classify a given video according to which type of action it contain...
In the community of computer vision, human pose estimation and human action recognition are two clas...
We address action recognition in videos by modeling the spatial-temporal structures of human poses. ...
Action recognition from videos has many potential applications. However, there are many unresolved c...
International audienceActivity recognition in video sequences is a difficult problem due to the comp...
This paper presents a unified framework for recognizing human action in video using human pose estim...
Abstract. Action detection is of great importance in understanding human motion from video. Compared...
Abstract Action recognition and pose estimation are two closely related topics in understanding huma...
Abstract. Action recognition from 3d pose data has gained increasing attention since the data is rea...
International audienceMost state-of-the-art methods for action recognition rely on a two-stream arch...
We present a novel method for learning human motion models from unsegmented videos. We propose a uni...
Action recognition from 3d pose data has gained increasing attention since the data is readily avail...
International audienceHuman action recognition in videos is still an important while challenging tas...
Detecting actions in videos is still a demanding task due to large intra-class variation caused by v...
A grand challenge of computer vision is to enable machines to ``see people\u27\u27. A solution to th...
Human action recognition aims to classify a given video according to which type of action it contain...
In the community of computer vision, human pose estimation and human action recognition are two clas...
We address action recognition in videos by modeling the spatial-temporal structures of human poses. ...
Action recognition from videos has many potential applications. However, there are many unresolved c...
International audienceActivity recognition in video sequences is a difficult problem due to the comp...
This paper presents a unified framework for recognizing human action in video using human pose estim...
Abstract. Action detection is of great importance in understanding human motion from video. Compared...
Abstract Action recognition and pose estimation are two closely related topics in understanding huma...
Abstract. Action recognition from 3d pose data has gained increasing attention since the data is rea...
International audienceMost state-of-the-art methods for action recognition rely on a two-stream arch...
We present a novel method for learning human motion models from unsegmented videos. We propose a uni...
Action recognition from 3d pose data has gained increasing attention since the data is readily avail...
International audienceHuman action recognition in videos is still an important while challenging tas...
Detecting actions in videos is still a demanding task due to large intra-class variation caused by v...
A grand challenge of computer vision is to enable machines to ``see people\u27\u27. A solution to th...
Human action recognition aims to classify a given video according to which type of action it contain...