Three-dimensional convolutional neural networks (3D CNNs) have been explored to learn spatio-temporal information for video-based human action recognition. Expensive computational cost and memory demand resulted from standard 3D CNNs, however, hinder their application in practical scenarios. In this article, we address the aforementioned limitations by proposing a novel dual 3-D convolutional network (D3DNet) with two complementary lightweight branches. A coarse branch maintains large temporal receptive field by a fast temporal downsampling strategy and simulates the expensive 3-D convolutions using a combination of more efficient spatial convolutions and temporal convolutions. Meanwhile, a fine branch progressively downsamples the video in...
3-D convolutional neural networks (3-D-convNets) have been very recently proposed for action recogni...
Human action recognition is attempting to identify what kind of action is being performed in a given...
In this work, the authors propose several techniques for accelerating a modern action recognition pi...
Effective processing of video input is essential for the recognition of temporally varying events su...
Effective processing of video input is essential for the recognition of temporally varying events su...
Effective processing of video input is essential for the recognition of temporally varying events su...
Effective processing of video input is essential for the recognition of temporally varying events su...
Video-based action recognition with deep neural networks has shown remarkable progress. However, mos...
Effective processing of video input is essential for the recognition of temporally varying events su...
Effective processing of video input is essential for the recognition of temporally varying events su...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
MasterThe recent 3D convolutional neural network (3D-CNN) is a promising candidate for solving the a...
Action recognition requires the accurate analysis of action elements in the form of a video clip and...
Human action recognition with color and depth sensors has received increasing attention in image pro...
3-D convolutional neural networks (3-D-convNets) have been very recently proposed for action recogni...
Human action recognition is attempting to identify what kind of action is being performed in a given...
In this work, the authors propose several techniques for accelerating a modern action recognition pi...
Effective processing of video input is essential for the recognition of temporally varying events su...
Effective processing of video input is essential for the recognition of temporally varying events su...
Effective processing of video input is essential for the recognition of temporally varying events su...
Effective processing of video input is essential for the recognition of temporally varying events su...
Video-based action recognition with deep neural networks has shown remarkable progress. However, mos...
Effective processing of video input is essential for the recognition of temporally varying events su...
Effective processing of video input is essential for the recognition of temporally varying events su...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
MasterThe recent 3D convolutional neural network (3D-CNN) is a promising candidate for solving the a...
Action recognition requires the accurate analysis of action elements in the form of a video clip and...
Human action recognition with color and depth sensors has received increasing attention in image pro...
3-D convolutional neural networks (3-D-convNets) have been very recently proposed for action recogni...
Human action recognition is attempting to identify what kind of action is being performed in a given...
In this work, the authors propose several techniques for accelerating a modern action recognition pi...