In this paper, we propose a weakly supervised temporal action localization method on untrimmed videos based on prototypical networks. We observe two challenges posed by weakly supervision, namely action-background separation and action relation construction. Unlike the previous method, we propose to achieve action-background separation only by the original videos. To achieve this, a clustering loss is adopted to separate actions from backgrounds and learn intra-compact features, which helps in detecting complete action instances. Besides, a similarity weighting module is devised to further separate actions from backgrounds. To effectively identify actions, we propose to construct relations among actions for prototype learning. A GCN-based p...
In this paper, we propose a two-stage temporal proposal algorithm for the action detection task of l...
This paper strives to localize the temporal extent of an action in a long untrimmed video. Where exi...
We address the problem of fine-grained action localization from temporally untrimmed web videos. We ...
Learning to localize actions in long, cluttered, and untrimmed videos is a hard task, that in the li...
The state-of-the-art of fully-supervised methods for temporal action localization from untrimmed vid...
Deep Learning (DL) based method for analysing dynamic graphical data has been a vital part of emergi...
This paper presents a computationally efficient approach for temporal action detection in untrimmed ...
Weakly supervised action recognition and localization for untrimmed videos is a challenging problem ...
Weakly-supervised temporal action localization is a very challenging problem because frame-wise labe...
Weakly-supervised temporal action localization is a very challenging problem because frame-wise labe...
This paper tackles the problem of localizing actions in long untrimmed videos. Different from existi...
In this paper, we propose a two-stage temporal proposal algorithm for the action detection task of l...
In this paper, we propose a two-stage temporal proposal algorithm for the action detection task of l...
Part 5: Classification - ClusteringInternational audienceWeakly-supervised temporal action localizat...
In this paper, we propose a two-stage temporal proposal algorithm for the action detection task of l...
In this paper, we propose a two-stage temporal proposal algorithm for the action detection task of l...
This paper strives to localize the temporal extent of an action in a long untrimmed video. Where exi...
We address the problem of fine-grained action localization from temporally untrimmed web videos. We ...
Learning to localize actions in long, cluttered, and untrimmed videos is a hard task, that in the li...
The state-of-the-art of fully-supervised methods for temporal action localization from untrimmed vid...
Deep Learning (DL) based method for analysing dynamic graphical data has been a vital part of emergi...
This paper presents a computationally efficient approach for temporal action detection in untrimmed ...
Weakly supervised action recognition and localization for untrimmed videos is a challenging problem ...
Weakly-supervised temporal action localization is a very challenging problem because frame-wise labe...
Weakly-supervised temporal action localization is a very challenging problem because frame-wise labe...
This paper tackles the problem of localizing actions in long untrimmed videos. Different from existi...
In this paper, we propose a two-stage temporal proposal algorithm for the action detection task of l...
In this paper, we propose a two-stage temporal proposal algorithm for the action detection task of l...
Part 5: Classification - ClusteringInternational audienceWeakly-supervised temporal action localizat...
In this paper, we propose a two-stage temporal proposal algorithm for the action detection task of l...
In this paper, we propose a two-stage temporal proposal algorithm for the action detection task of l...
This paper strives to localize the temporal extent of an action in a long untrimmed video. Where exi...
We address the problem of fine-grained action localization from temporally untrimmed web videos. We ...