Feature ranking from video-wide temporal evolution brings reliable information for complex action recognition. However, a video may contain similar features in the sequence of frames which deliver unnecessary information to the ranking function. This paper proposes a method to improve the rankpooling strategy which captures the optimized latent structure of the video sequence data. The optimization is followed by removing the redundant features from the sequence data. The cosine and correlation distance metrics are employed to detect the identical features and extract the most efficient information from the video frames. Then, the ranked features are generated from the optimized and clean sequence data. The proposed improvement is easy to i...
This PhD research has proposed new machine learning techniques to improve human action recognition b...
We introduce a simple yet effective network that embeds a novel Discriminative Feature Pooling (DFP)...
Local video features provide state-of-the-art performance for action recognition. While the accuracy...
We propose a function-based temporal pooling method that captures the latent structure of the video ...
We propose a function-based temporal pooling method that captures the latent structure of the video ...
© 2015 IEEE. In this paper we present a method to capture video-wide temporal information for action...
Most video based action recognition approaches create the video-level representation by temporally p...
Rank pooling is a temporal encoding method that summarizes the dynamics of a video sequence to a sin...
Deep learning models for video-based action recognition usually generate features for short clips (c...
The field of Action Recognition has seen a large increase in activity in recent years. Much of the p...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
Human action recognition is valuable for numerous practical applications, e.g., gaming, video survei...
In this paper we propose a novel method for human action recognition based on boosted key-frame sele...
Human action recognition in videos draws strong research interest in computer vision because of its ...
This PhD research has proposed new machine learning techniques to improve human action recognition b...
We introduce a simple yet effective network that embeds a novel Discriminative Feature Pooling (DFP)...
Local video features provide state-of-the-art performance for action recognition. While the accuracy...
We propose a function-based temporal pooling method that captures the latent structure of the video ...
We propose a function-based temporal pooling method that captures the latent structure of the video ...
© 2015 IEEE. In this paper we present a method to capture video-wide temporal information for action...
Most video based action recognition approaches create the video-level representation by temporally p...
Rank pooling is a temporal encoding method that summarizes the dynamics of a video sequence to a sin...
Deep learning models for video-based action recognition usually generate features for short clips (c...
The field of Action Recognition has seen a large increase in activity in recent years. Much of the p...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
Human action recognition is valuable for numerous practical applications, e.g., gaming, video survei...
In this paper we propose a novel method for human action recognition based on boosted key-frame sele...
Human action recognition in videos draws strong research interest in computer vision because of its ...
This PhD research has proposed new machine learning techniques to improve human action recognition b...
We introduce a simple yet effective network that embeds a novel Discriminative Feature Pooling (DFP)...
Local video features provide state-of-the-art performance for action recognition. While the accuracy...