Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bounding boxes), we present a new spatio-temporal action localization detector Segment-tube, which consists of sequences of per-frame segmentation masks. The proposed Segment-tube detector can temporally pinpoint the starting/ending frame of each action category in the presence of preceding/subsequent interference actions in untrimmed videos. Simultaneously, the Segment-tube detector produces per-frame segmentation masks instead of bounding boxes, offering superior spatial accuracy to tubelets. This is achieved by alternating iterative optimization between temporal action localization and spatial action segmentation. Experimental results on three ...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...
Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bound...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
This paper presents a computationally efficient approach for temporal action detection in untrimmed ...
Current state-of-the-art human action recognition is focused on the classification of temporally tri...
We propose a temporal action detection by spatial segmentation framework, which simultaneously categ...
This paper considers the problem of action localization, where the objective is to determine when an...
This paper considers the problem of localizing actions in videos as sequences of bounding boxes. The...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...
Inspired by the recent spatio-temporal action localization efforts with tubelets (sequences of bound...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
This paper presents a computationally efficient approach for temporal action detection in untrimmed ...
Current state-of-the-art human action recognition is focused on the classification of temporally tri...
We propose a temporal action detection by spatial segmentation framework, which simultaneously categ...
This paper considers the problem of action localization, where the objective is to determine when an...
This paper considers the problem of localizing actions in videos as sequences of bounding boxes. The...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceCurrent state-of-the-art approaches for spatio-temporal action detection rely ...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
International audienceThis paper considers the problem of action localization, where the objective i...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...