In this paper, we propose an approach to classify action sequences. We observe that in action sequences the critical features for discriminating between actions occur only within sub-regions of the image. Hence deep network approaches will address the entire image are at a disadvantage. This motivates our strategy which uses static and spatio-temporal visual cues to isolate static and spatio-temporal regions of interest (ROIs). We then use weakly supervised learning to train deep network classifiers using the ROIs as input. More specifically, we combine multiple instance learning (MIL) with convolutional neural networks (CNNs) to select discriminative action cues. This yields classifiers for static images, using the static ROIs, as well as ...
In recent years, deep learning techniques have excelled in video action recognition. However, curren...
Human action recognition plays a crucial role in various applications, including video surveillance,...
This paper proposes a new method, i.e., weighted hierarchical depth motion maps (WHDMM) + three-chan...
Action recognition requires the accurate analysis of action elements in the form of a video clip and...
In this article, a hierarchical method for action recognition based on temporal and spatial features...
Deep convolutional network models have dominated recent work in human action recognition as well as ...
<p>Recognizing human actions in videos is a challenging problem owning to complex motion appearance,...
In this study, we investigate the problem of automatic action recognition and classification of vid...
A common approach to human action recognition from still images consists in computing local descript...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
Common approaches to human action recognition from images rely on local descriptors for classificati...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
Recognizing actions is one of the important challenges in computer vision with respect to video data...
Action recognition has been an active research topic for over three decades. There are various appli...
In recent years, deep learning techniques have excelled in video action recognition. However, curren...
Human action recognition plays a crucial role in various applications, including video surveillance,...
This paper proposes a new method, i.e., weighted hierarchical depth motion maps (WHDMM) + three-chan...
Action recognition requires the accurate analysis of action elements in the form of a video clip and...
In this article, a hierarchical method for action recognition based on temporal and spatial features...
Deep convolutional network models have dominated recent work in human action recognition as well as ...
<p>Recognizing human actions in videos is a challenging problem owning to complex motion appearance,...
In this study, we investigate the problem of automatic action recognition and classification of vid...
A common approach to human action recognition from still images consists in computing local descript...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
Common approaches to human action recognition from images rely on local descriptors for classificati...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
Recognizing actions is one of the important challenges in computer vision with respect to video data...
Action recognition has been an active research topic for over three decades. There are various appli...
In recent years, deep learning techniques have excelled in video action recognition. However, curren...
Human action recognition plays a crucial role in various applications, including video surveillance,...
This paper proposes a new method, i.e., weighted hierarchical depth motion maps (WHDMM) + three-chan...