In this work, we focus on semi-supervised learning for video action detection which utilizes both labeled as well as unlabeled data. We propose a simple end-to-end consistency based approach which effectively utilizes the unlabeled data. Video action detection requires both, action class prediction as well as a spatio-temporal localization of actions. Therefore, we investigate two types of constraints, classification consistency, and spatio-temporal consistency. The presence of predominant background and static regions in a video makes it challenging to utilize spatio-temporal consistency for action detection. To address this, we propose two novel regularization constraints for spatio-temporal consistency; 1) temporal coherency, and 2) grad...
Temporal action detection in long, untrimmed videos is an important yet challenging task that requir...
Temporal action detection in long, untrimmed videos is an important yet challenging task that requir...
This paper addresses the problem of automatic temporal annotation of realistic human actions in vide...
International audienceSpatio-temporal action detection in videos is typically addressed in a fully-s...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
In recent years, many research works have been car-ried out to recognize human actions from video cl...
Automatically recognizing and localizing wide ranges of human actions are crucial for video understa...
Human action recognition in videos draws strong research interest in computer vision because of its ...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
Detecting and recognizing human actions is of great importance to video analytics due to its numerou...
Understanding human actions in videos is of great interest in various scenarios ranging from surveil...
Human behavior understanding is a fundamental problem of computer vision. It is an important compone...
Abstract—Actions are spatio-temporal patterns. Similar to the slid-ing window-based object detection...
International audienceAutomatically recognizing and localizing wide ranges of human actions are cruc...
Temporal action detection in long, untrimmed videos is an important yet challenging task that requir...
Temporal action detection in long, untrimmed videos is an important yet challenging task that requir...
This paper addresses the problem of automatic temporal annotation of realistic human actions in vide...
International audienceSpatio-temporal action detection in videos is typically addressed in a fully-s...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
In recent years, many research works have been car-ried out to recognize human actions from video cl...
Automatically recognizing and localizing wide ranges of human actions are crucial for video understa...
Human action recognition in videos draws strong research interest in computer vision because of its ...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
Detecting and recognizing human actions is of great importance to video analytics due to its numerou...
Understanding human actions in videos is of great interest in various scenarios ranging from surveil...
Human behavior understanding is a fundamental problem of computer vision. It is an important compone...
Abstract—Actions are spatio-temporal patterns. Similar to the slid-ing window-based object detection...
International audienceAutomatically recognizing and localizing wide ranges of human actions are cruc...
Temporal action detection in long, untrimmed videos is an important yet challenging task that requir...
Temporal action detection in long, untrimmed videos is an important yet challenging task that requir...
This paper addresses the problem of automatic temporal annotation of realistic human actions in vide...