As an important branch of video analysis, human action recognition has attracted extensive research attention in computer vision and artificial intelligence communities. In this paper, we propose to model the temporal evolution of multi-temporal-scale atoms for action recognition. An action can be considered as a temporal sequence of action units. These action units which we referred to as action atoms, can capture the key semantic and characteristic spatiotemporal features of actions in different temporal scales. We first investigate Res3D, a powerful 3D CNN architecture and create the variants of Res3D for different temporal scale. In each temporal scale, we design some practices to transfer the knowledge learned from RGB to optical flow ...
<p>Recognizing human actions in videos is a challenging problem owning to complex motion appearance,...
International audienceIn this paper, we introduce a new approach for Activities of Daily Living (ADL...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
Recognizing actions according to video features is an important problem in a wide scope of applicati...
Spatiotemporal and motion feature representations are the key to video action recognition. Typical p...
Video-based action recognition with deep neural networks has shown remarkable progress. However, mos...
Action recognition requires the accurate analysis of action elements in the form of a video clip and...
Human action recognition plays a crucial role in various applications, including video surveillance,...
This thesis contributes to the literature of understanding and recognizing human activities in video...
International audienceTypical human actions last several seconds and exhibit characteristic spatio-t...
From wearable devices to depth cameras, researchers have exploited various multimodal data to recogn...
Automatic action and gesture recognition research field has growth in interest over the last few yea...
Human action recognition is an important task in computer vision. Extracting discriminative spatial ...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
<p>Recognizing human actions in videos is a challenging problem owning to complex motion appearance,...
International audienceIn this paper, we introduce a new approach for Activities of Daily Living (ADL...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
Recognizing actions according to video features is an important problem in a wide scope of applicati...
Spatiotemporal and motion feature representations are the key to video action recognition. Typical p...
Video-based action recognition with deep neural networks has shown remarkable progress. However, mos...
Action recognition requires the accurate analysis of action elements in the form of a video clip and...
Human action recognition plays a crucial role in various applications, including video surveillance,...
This thesis contributes to the literature of understanding and recognizing human activities in video...
International audienceTypical human actions last several seconds and exhibit characteristic spatio-t...
From wearable devices to depth cameras, researchers have exploited various multimodal data to recogn...
Automatic action and gesture recognition research field has growth in interest over the last few yea...
Human action recognition is an important task in computer vision. Extracting discriminative spatial ...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
<p>Recognizing human actions in videos is a challenging problem owning to complex motion appearance,...
International audienceIn this paper, we introduce a new approach for Activities of Daily Living (ADL...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...