We propose a function-based temporal pooling method that captures the latent structure of the video sequence data - e.g., how frame-level features evolve over time in a video. We show how the parameters of a function that has been fit to the video data can serve as a robust new video representation. As a specific example, we learn a pooling function via ranking machines. By learning to rank the frame-level features of a video in chronological order, we obtain a new representation that captures the video-wide temporal dynamics of a video, suitable for action recognition. Other than ranking functions, we explore different parametric models that could also explain the temporal changes in videos. The proposed functional pooling methods, and ran...
First-person videos have unique characteristics such as heavy egocentric motion, strong preceding e...
Recognizing multiple types of actions appearing in a continuous temporal order from a streaming vide...
We introduce a simple yet effective network that embeds a novel Discriminative Feature Pooling (DFP)...
We propose a function-based temporal pooling method that captures the latent structure of the video ...
Feature ranking from video-wide temporal evolution brings reliable information for complex action re...
© 2015 IEEE. In this paper we present a method to capture video-wide temporal information for action...
Most video based action recognition approaches create the video-level representation by temporally p...
Rank pooling is a temporal encoding method that summarizes the dynamics of a video sequence to a sin...
Deep learning models for video-based action recognition usually generate features for short clips (c...
Human action recognition is valuable for numerous practical applications, e.g., gaming, video survei...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
In this dissertation, I present my work towards exploring temporal information for better video unde...
Recent studies have demonstrated the power of recurrent neural networks for machine translation, ima...
First-person videos have unique characteristics such as heavy egocentric motion, strong preceding e...
Recognizing multiple types of actions appearing in a continuous temporal order from a streaming vide...
We introduce a simple yet effective network that embeds a novel Discriminative Feature Pooling (DFP)...
We propose a function-based temporal pooling method that captures the latent structure of the video ...
Feature ranking from video-wide temporal evolution brings reliable information for complex action re...
© 2015 IEEE. In this paper we present a method to capture video-wide temporal information for action...
Most video based action recognition approaches create the video-level representation by temporally p...
Rank pooling is a temporal encoding method that summarizes the dynamics of a video sequence to a sin...
Deep learning models for video-based action recognition usually generate features for short clips (c...
Human action recognition is valuable for numerous practical applications, e.g., gaming, video survei...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
We introduce the concept of dynamic image, a novel compact representation of videos useful for video...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
In this dissertation, I present my work towards exploring temporal information for better video unde...
Recent studies have demonstrated the power of recurrent neural networks for machine translation, ima...
First-person videos have unique characteristics such as heavy egocentric motion, strong preceding e...
Recognizing multiple types of actions appearing in a continuous temporal order from a streaming vide...
We introduce a simple yet effective network that embeds a novel Discriminative Feature Pooling (DFP)...