Efficiency is an important issue in designing video architectures for action recognition. 3D CNNs have witnessed remarkable progress in action recognition from videos. However, compared with their 2D counterparts, 3D convolutions often introduce a large amount of parameters and cause high computational cost. To relieve this problem, we propose an efficient temporal module, termed as Temporal Enhancement-and-Interaction (TEI Module), which could be plugged into the existing 2D CNNs (denoted by TEINet). The TEI module presents a different paradigm to learn temporal features by decoupling the modeling of channel correlation and temporal interaction. First, it contains a Motion Enhanced Module (MEM) which is to enhance the motion-related featur...
The paucity of videos in current action classification datasets (UCF-101 and HMDB-51) has made it di...
Human activity recognition in videos with convolutional neural network (CNN) features has received i...
As a sub-field of video content analysis, action recognition has received extensive attention in rec...
Efficiency is an important issue in designing video architectures for action recognition. 3D CNNs ha...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
Spatiotemporal and motion feature representations are the key to video action recognition. Typical p...
Video-based action recognition with deep neural networks has shown remarkable progress. However, mos...
IEEE The explosive growth in video streaming requires video understanding at high accuracy and low c...
MasterThis thesis proposes the mixed temporal kernel depthwise-separable convolution network that ex...
Research for computation-efficient video understanding is of great importance to real-world deployme...
In this dissertation, I present my work towards exploring temporal information for better video unde...
Effective processing of video input is essential for the recognition of temporally varying events su...
In recent years, deep learning techniques have excelled in video action recognition. However, curren...
The paucity of videos in current action classification datasets (UCF-101 and HMDB-51) has made it di...
Human activity recognition in videos with convolutional neural network (CNN) features has received i...
As a sub-field of video content analysis, action recognition has received extensive attention in rec...
Efficiency is an important issue in designing video architectures for action recognition. 3D CNNs ha...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
Spatiotemporal and motion feature representations are the key to video action recognition. Typical p...
Video-based action recognition with deep neural networks has shown remarkable progress. However, mos...
IEEE The explosive growth in video streaming requires video understanding at high accuracy and low c...
MasterThis thesis proposes the mixed temporal kernel depthwise-separable convolution network that ex...
Research for computation-efficient video understanding is of great importance to real-world deployme...
In this dissertation, I present my work towards exploring temporal information for better video unde...
Effective processing of video input is essential for the recognition of temporally varying events su...
In recent years, deep learning techniques have excelled in video action recognition. However, curren...
The paucity of videos in current action classification datasets (UCF-101 and HMDB-51) has made it di...
Human activity recognition in videos with convolutional neural network (CNN) features has received i...
As a sub-field of video content analysis, action recognition has received extensive attention in rec...