Video sequence classification is the process of recognizing the semantic labels of the given video sequences. It has a wide range of applications in human action recognition, video genre classification, and abnormal behavior detection. The essential cues for video classification are the spatial structures and their changes along the time axis. This thesis presents two methods for pre-processing the input video sequence. First, an efficient optical flow estimation method is proposed to estimate pixel-level velocity from two adjacent frames. This method applies max pooling and min pooling to construct a hierarchical feature structure for coarse-to-fine patch matching. Optical flow is then estimated from the matching set via an interpolation a...
We propose a set of kinematic features that are derived from the optical flow for human action recog...
The task of aligning multiple audio visual sequences with similar contents needs careful synchronisa...
We propose a set of kinematic features that are derived from the optical flow for human action recog...
Video sequence classification is the process of recognizing the semantic labels of the given video s...
Feature point detection and local feature extraction are the two critical steps in trajectory-based ...
Classification of human actions from real-world video data is one of the most important topics in co...
© 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
The development of the Internet makes the number of online videos increase dramatically, which bring...
A real world scene may contain several objects with different spatial and temporal characteristics. ...
Recognizing actions is one of the important challenges in computer vision with respect to video data...
A real world scene may contain several objects with dif-ferent spatial and temporal characteristics....
Project page: https://rohitgirdhar.github.io/ActionVLAD/International audienceIn this work, we intro...
This paper presents and investigates a set of local space-time descriptors for representing and reco...
This work deals with audio-visual video recognition using machine learning. A general audio-visual v...
Representation learning is a fundamental research problem in the area of machine learning, refining ...
We propose a set of kinematic features that are derived from the optical flow for human action recog...
The task of aligning multiple audio visual sequences with similar contents needs careful synchronisa...
We propose a set of kinematic features that are derived from the optical flow for human action recog...
Video sequence classification is the process of recognizing the semantic labels of the given video s...
Feature point detection and local feature extraction are the two critical steps in trajectory-based ...
Classification of human actions from real-world video data is one of the most important topics in co...
© 20xx IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for a...
The development of the Internet makes the number of online videos increase dramatically, which bring...
A real world scene may contain several objects with different spatial and temporal characteristics. ...
Recognizing actions is one of the important challenges in computer vision with respect to video data...
A real world scene may contain several objects with dif-ferent spatial and temporal characteristics....
Project page: https://rohitgirdhar.github.io/ActionVLAD/International audienceIn this work, we intro...
This paper presents and investigates a set of local space-time descriptors for representing and reco...
This work deals with audio-visual video recognition using machine learning. A general audio-visual v...
Representation learning is a fundamental research problem in the area of machine learning, refining ...
We propose a set of kinematic features that are derived from the optical flow for human action recog...
The task of aligning multiple audio visual sequences with similar contents needs careful synchronisa...
We propose a set of kinematic features that are derived from the optical flow for human action recog...