This paper tackles the problem of localizing actions in long untrimmed videos. Different from existing works, which all use annotated untrimmed videos during training, we learn only from short trimmed videos. This enables learning from large-scale datasets originally designed for action classification. We propose a method to train an action localization network that segments a video into interpretable fragments, we call ActionBytes. Our method jointly learns to cluster ActionBytes and trains the localization network using the cluster assignments as pseudo-labels. By doing so, we train on short trimmed videos that become untrimmed for ActionBytes. In isolation, or when merged, the ActionBytes also serve as effective action proposals. Experim...
This paper addresses the problem of automatic temporal annotation of realistic human actions in vide...
This paper tackles the problem of spatio-temporal action localization in a video, without assuming t...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
Learning to localize actions in long, cluttered, and untrimmed videos is a hard task, that in the li...
In this article, we proposed a technique for action localization and recognition from long untrimmed...
This paper is the first to address the problem of unsupervised action localization in videos. Given ...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...
Human behavior understanding is a fundamental problem of computer vision. It is an important compone...
The goal of this paper is to determine the spatio-temporal location of actions in video. Where train...
We address the problem of fine-grained action localization from temporally untrimmed web videos. We ...
This paper strives to localize the temporal extent of an action in a long untrimmed video. Where exi...
This paper presents a computationally efficient approach for temporal action detection in untrimmed ...
Weakly supervised action recognition and localization for untrimmed videos is a challenging problem ...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
This paper addresses the problem of automatic temporal annotation of realistic human actions in vide...
This paper tackles the problem of spatio-temporal action localization in a video, without assuming t...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
Learning to localize actions in long, cluttered, and untrimmed videos is a hard task, that in the li...
In this article, we proposed a technique for action localization and recognition from long untrimmed...
This paper is the first to address the problem of unsupervised action localization in videos. Given ...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...
Human behavior understanding is a fundamental problem of computer vision. It is an important compone...
The goal of this paper is to determine the spatio-temporal location of actions in video. Where train...
We address the problem of fine-grained action localization from temporally untrimmed web videos. We ...
This paper strives to localize the temporal extent of an action in a long untrimmed video. Where exi...
This paper presents a computationally efficient approach for temporal action detection in untrimmed ...
Weakly supervised action recognition and localization for untrimmed videos is a challenging problem ...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
This paper addresses the problem of automatic temporal annotation of realistic human actions in vide...
This paper tackles the problem of spatio-temporal action localization in a video, without assuming t...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...