Automatic understanding of videos is one of the most active areas of computer vision research. It has applications in video surveillance, human computer interaction, video sports analysis, virtual and augmented reality, video retrieval etc. In this dissertation, we address four important tasks in video understanding, namely action recognition, temporal action localization, spatial-temporal action detection and video object/action segmentation. This dissertation makes contributions to above tasks by proposing. First, for video action recognition, we propose a category level feature learning method. Our proposed method automatically identifies such pairs of categories using a criterion of mutual pairwise proximity in the (kernelized) feature ...
In this thesis the problem of automatic human action recognition and localization in videos is studi...
Action recognition requires the accurate analysis of action elements in the form of a video clip and...
Despite outstanding performance in image recognition, convolutional neural networks (CNNs) do not ye...
Deep learning has been demonstrated to achieve excellent results for image classification and object...
Classification of human actions from real-world video data is one of the most important topics in co...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
Technological innovation in the field of video action recognition drives the development of video-ba...
In this project, our work can be divided into two parts: RGB-D based action recognition in trimmed v...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
As imaging systems become ubiquitous, the ability to recognize human actions is becoming increasingl...
Temporal action detection in long, untrimmed videos is an important yet challenging task that requir...
Temporal action detection in long, untrimmed videos is an important yet challenging task that requir...
As imaging systems become ubiquitous, the ability to recognize human actions is becoming increasingl...
With an exponential growth in the number of video capturing devices and digital video content, autom...
Despite their great predictive capability, Convolutional Neural Networks (CNNs) are computational-ex...
In this thesis the problem of automatic human action recognition and localization in videos is studi...
Action recognition requires the accurate analysis of action elements in the form of a video clip and...
Despite outstanding performance in image recognition, convolutional neural networks (CNNs) do not ye...
Deep learning has been demonstrated to achieve excellent results for image classification and object...
Classification of human actions from real-world video data is one of the most important topics in co...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
Technological innovation in the field of video action recognition drives the development of video-ba...
In this project, our work can be divided into two parts: RGB-D based action recognition in trimmed v...
This thesis focuses on video understanding for human action and interaction recognition. We start by...
As imaging systems become ubiquitous, the ability to recognize human actions is becoming increasingl...
Temporal action detection in long, untrimmed videos is an important yet challenging task that requir...
Temporal action detection in long, untrimmed videos is an important yet challenging task that requir...
As imaging systems become ubiquitous, the ability to recognize human actions is becoming increasingl...
With an exponential growth in the number of video capturing devices and digital video content, autom...
Despite their great predictive capability, Convolutional Neural Networks (CNNs) are computational-ex...
In this thesis the problem of automatic human action recognition and localization in videos is studi...
Action recognition requires the accurate analysis of action elements in the form of a video clip and...
Despite outstanding performance in image recognition, convolutional neural networks (CNNs) do not ye...