We address the problem of fine-grained action localization from temporally untrimmed web videos. We assume that only weak video-level annotations are available for training. The goal is to use these weak labels to identify temporal segments corresponding to the actions, and learn models that generalize to unconstrained web videos. We find that web images queried by action names serve as well-localized highlights for many actions, but are noisily labeled. To solve this problem, we propose a simple yet effective method that takes weak video labels and noisy image labels as in-put, and generates localized action frames as output. This is achieved by cross-domain transfer between video frames and web images, using pre-trained deep convolutional...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
The goal of this paper is to determine the spatio-temporal location of actions in video. Where train...
This paper tackles the problem of spatio-temporal action localization in a video, without assuming t...
Learning to localize actions in long, cluttered, and untrimmed videos is a hard task, that in the li...
In this thesis, weakly-supervised temporal activity localization and classification is considered wi...
In this thesis, weakly-supervised temporal activity localization and classification is considered wi...
This paper tackles the problem of localizing actions in long untrimmed videos. Different from existi...
In this thesis the problem of automatic human action recognition and localization in videos is studi...
Deep Learning (DL) based method for analysing dynamic graphical data has been a vital part of emergi...
This paper presents a computationally efficient approach for temporal action detection in untrimmed ...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...
Human behavior understanding is a fundamental problem of computer vision. It is an important compone...
This paper is the first to address the problem of unsupervised action localization in videos. Given ...
This paper strives to localize the temporal extent of an action in a long untrimmed video. Where exi...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
The goal of this paper is to determine the spatio-temporal location of actions in video. Where train...
This paper tackles the problem of spatio-temporal action localization in a video, without assuming t...
Learning to localize actions in long, cluttered, and untrimmed videos is a hard task, that in the li...
In this thesis, weakly-supervised temporal activity localization and classification is considered wi...
In this thesis, weakly-supervised temporal activity localization and classification is considered wi...
This paper tackles the problem of localizing actions in long untrimmed videos. Different from existi...
In this thesis the problem of automatic human action recognition and localization in videos is studi...
Deep Learning (DL) based method for analysing dynamic graphical data has been a vital part of emergi...
This paper presents a computationally efficient approach for temporal action detection in untrimmed ...
This paper strives for spatio-temporal localization of human actions in videos. In the literature, t...
Human behavior understanding is a fundamental problem of computer vision. It is an important compone...
This paper is the first to address the problem of unsupervised action localization in videos. Given ...
This paper strives to localize the temporal extent of an action in a long untrimmed video. Where exi...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
We present a novel probabilistic model for recognizing actions by identifying and extracting informa...
The goal of this paper is to determine the spatio-temporal location of actions in video. Where train...