International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely on detections at the frame level that are then linked or tracked across time. In this paper, we leverage the temporal continuity of videos instead of operating at the frame level. We propose the ACtion Tubelet detector (ACT-detector) that takes as input a sequence of frames and outputs tubelets, ie., sequences of bounding boxes with associated scores. The same way state-of-the-art object detectors rely on anchor boxes, our ACT-detector is based on anchor cuboids. We build upon the state-of-the-art SSD framework. Convolutional features are extracted for each frame, while scores and regressions are based on the temporal stacking of these featu...
This paper considers the problem of localizing actions in videos as sequences of bounding boxes. The...
<p>This paper considers the problem of localizing actions in videos as sequences of bounding boxes. ...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bound...
Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bound...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
This paper considers the problem of localizing actions in videos as sequences of bounding boxes. The...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
International audienceThis paper considers the problem of localizing actions in videos as sequences ...
International audienceThis paper considers the problem of localizing actions in videos as sequences ...
This paper considers the problem of localizing actions in videos as sequences of bounding boxes. The...
<p>This paper considers the problem of localizing actions in videos as sequences of bounding boxes. ...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bound...
Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bound...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
This paper considers the problem of localizing actions in videos as sequences of bounding boxes. The...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
International audienceThis paper considers the problem of localizing actions in videos as sequences ...
International audienceThis paper considers the problem of localizing actions in videos as sequences ...
This paper considers the problem of localizing actions in videos as sequences of bounding boxes. The...
<p>This paper considers the problem of localizing actions in videos as sequences of bounding boxes. ...
International audienceThis paper considers the problem of action localization, where the objective i...