International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely on detections at the frame level that are then linked or tracked across time. In this paper, we leverage the temporal continuity of videos instead of operating at the frame level. We propose the ACtion Tubelet detector (ACT-detector) that takes as input a sequence of frames and outputs tubelets, ie., sequences of bounding boxes with associated scores. The same way state-of-the-art object detectors rely on anchor boxes, our ACT-detector is based on anchor cuboids. We build upon the state-of-the-art SSD framework. Convolutional features are extracted for each frame, while scores and regressions are based on the temporal stacking of these featu...
This thesis tackles the spatiotemporal action detection problem from an online, efficient, and real-...
This thesis tackles the spatiotemporal action detection problem from an online, efficient, and real-...
This thesis tackles the spatiotemporal action detection problem from an online, efficient, and real-...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bound...
Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bound...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
International audienceWe propose an effective approach for action localization, both in the spatial ...
International audienceWe propose an effective approach for action localization, both in the spatial ...
This thesis tackles the spatiotemporal action detection problem from an online, efficient, and real-...
This thesis tackles the spatiotemporal action detection problem from an online, efficient, and real-...
This thesis tackles the spatiotemporal action detection problem from an online, efficient, and real-...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bound...
Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bound...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
The rise of deep learning has facilitated remarkable progress in video understanding. This thesis ad...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
International audienceWe propose an effective approach for action localization, both in the spatial ...
International audienceWe propose an effective approach for action localization, both in the spatial ...
This thesis tackles the spatiotemporal action detection problem from an online, efficient, and real-...
This thesis tackles the spatiotemporal action detection problem from an online, efficient, and real-...
This thesis tackles the spatiotemporal action detection problem from an online, efficient, and real-...