Research for computation-efficient video understanding is of great importance to real-world deployment. However, most of high-performance approaches are too computationally expensive for practical application. Though several efficiency oriented works are proposed, they inevitably suffer degradation of performance in terms of accuracy. In this paper, we explore a new architecture EAC-Net, enjoying both high efficiency and high performance. Specifically, we propose Motion Guided Temporal Encode (MGTE) blocks for temporal modeling, which exploits motion information and temporal relations among neighbor frames. EAC-Net is then constructed by inserting multiple MGTE blocks to common 2D CNNs. Furthermore, we proposed Atrous Temporal Encode (ATE) ...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
We address the challenge of learning good video representations by explicitly modeling the relations...
As a sub-field of video content analysis, action recognition has received extensive attention in rec...
Efficiency is an important issue in designing video architectures for action recognition. 3D CNNs ha...
Video-based action recognition with deep neural networks has shown remarkable progress. However, mos...
© 1991-2012 IEEE. Encouraged by the success of convolutional neural networks (CNNs) in image classif...
IEEE The explosive growth in video streaming requires video understanding at high accuracy and low c...
Despite the success of deep learning for static image understanding, it remains unclear what are the...
Spatiotemporal and motion feature representations are the key to video action recognition. Typical p...
The temporal events in video sequences often have long-term dependencies which are difficult to be h...
Encouraged by the success of Convolutional Neural Networks (CNNs) in image classification, recently ...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
Effective processing of video input is essential for the recognition of temporally varying events su...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
In this work, the authors propose several techniques for accelerating a modern action recognition pi...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
We address the challenge of learning good video representations by explicitly modeling the relations...
As a sub-field of video content analysis, action recognition has received extensive attention in rec...
Efficiency is an important issue in designing video architectures for action recognition. 3D CNNs ha...
Video-based action recognition with deep neural networks has shown remarkable progress. However, mos...
© 1991-2012 IEEE. Encouraged by the success of convolutional neural networks (CNNs) in image classif...
IEEE The explosive growth in video streaming requires video understanding at high accuracy and low c...
Despite the success of deep learning for static image understanding, it remains unclear what are the...
Spatiotemporal and motion feature representations are the key to video action recognition. Typical p...
The temporal events in video sequences often have long-term dependencies which are difficult to be h...
Encouraged by the success of Convolutional Neural Networks (CNNs) in image classification, recently ...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
Effective processing of video input is essential for the recognition of temporally varying events su...
This paper introduces a fusion convolutional architecture for efficient learning of spatio-temporal ...
In this work, the authors propose several techniques for accelerating a modern action recognition pi...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
We address the challenge of learning good video representations by explicitly modeling the relations...
As a sub-field of video content analysis, action recognition has received extensive attention in rec...