The task of multimedia event detection (MED) aims at training a set of models that can automatically detect the most event-relevant videos from large datasets. In this paper, we attempt to build a robust spatial-temporal deep neural network for large-scale video event detection. In our setting, each video follows a multiple instance assumption, where its visual segments contain both spatial and temporal properties of events. Regarding these properties, we try to implement the MED system by a two-step training phase: unsupervised recurrent video reconstruction and supervised fine-tuning. We conduct extensive experiments on the challenging TRECVID MED14 dataset, which indicate that with the consideration of both spatial and temporal informati...
Detecting complex video events based on audio and visual modalities is still a largely unresolved is...
The amount of available surveillance video data is increasing rapidly and therefore makes manual ins...
Video event detection allows intelligent indexing of video content based on events. Traditional appr...
© 2015 IEEE. In this paper, we focus on complex event detection in internet videos while also provid...
<p> Abnormal event detection is extremely important, especially for video surveillance. Nowadays, m...
We present a novel hierarchical, distributed model for unsupervised learning of invariant spatio-tem...
In this paper, we propose a discriminative video rep-resentation for event detection over a large sc...
In this paper, we propose a discriminative video representation for event detection over a large sca...
The management of digital video has become a very challenging problem as the amount of video content...
We present a novel hierarchical and distributed model for learning invariant spatio-temporal feature...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
We often come across events on our daily commute such as a traffic jam, a person running a red light...
This thesis has addressed the topic of event detection in videos, which is a challenging problem as ...
Within a large range of applications in computer vision, Human Action Recognition has become one of ...
With the proliferation of online services and mobile technologies, the world has stepped into a mult...
Detecting complex video events based on audio and visual modalities is still a largely unresolved is...
The amount of available surveillance video data is increasing rapidly and therefore makes manual ins...
Video event detection allows intelligent indexing of video content based on events. Traditional appr...
© 2015 IEEE. In this paper, we focus on complex event detection in internet videos while also provid...
<p> Abnormal event detection is extremely important, especially for video surveillance. Nowadays, m...
We present a novel hierarchical, distributed model for unsupervised learning of invariant spatio-tem...
In this paper, we propose a discriminative video rep-resentation for event detection over a large sc...
In this paper, we propose a discriminative video representation for event detection over a large sca...
The management of digital video has become a very challenging problem as the amount of video content...
We present a novel hierarchical and distributed model for learning invariant spatio-temporal feature...
In this work, we propose an approach to the spatiotemporal localisation (detection) and classificati...
We often come across events on our daily commute such as a traffic jam, a person running a red light...
This thesis has addressed the topic of event detection in videos, which is a challenging problem as ...
Within a large range of applications in computer vision, Human Action Recognition has become one of ...
With the proliferation of online services and mobile technologies, the world has stepped into a mult...
Detecting complex video events based on audio and visual modalities is still a largely unresolved is...
The amount of available surveillance video data is increasing rapidly and therefore makes manual ins...
Video event detection allows intelligent indexing of video content based on events. Traditional appr...