In this paper, we describe the Raytheon BBN Technologies (BBN) led VISER system for the TRECVID 2014 Multimedia Event Detection (MED) and Recounting (MER) tasks. We present a comprehensive analysis of the different modules: (1) Metadata Generator (MG) – a large suite of audio-visual low-level and sematic features; a set of deep convolutional neural network (DCNN) features trained on the ImageNet dataset; automatic speech recognition (ASR); videotext detection and recognition (OCR). For the low-level features, we used D-SIFT, Opponent SIFT, dense trajectories (HOG+HOF+MBH), MFCC and Fisher Vector (FV) representation. For the semantic concepts, we have trained 1,800 weakly supervised concepts from the Research Set videos and a set of YouTube...
This paper presents an overview and comparative analysis of our systems designed for the TRECVID 201...
Multimedia Event Detection (MED) aims to identify events—also called scenes—in videos, such as a flas...
In this paper, we propose a discriminative video rep-resentation for event detection over a large sc...
We describe the Raytheon BBN Technologies (BBN) led VISER system for the TRECVID 2013 Multimedia Eve...
We describe the Raytheon BBN Technologies (BBN) led VISER system for the TRECVID 2013 Multimedia Eve...
We describe the Raytheon BBN Technologies (BBN) led VISER system for the TRECVID 2012 Multimedia Eve...
We describe the Raytheon BBN (BBN) VISER system that is designed to detect events of interest in mul...
<p>In this paper, we present recent experiments on using Artificial Neural Networks (ANNs), a new “d...
The management of digital video has become a very challenging problem as the amount of video content...
We often come across events on our daily commute such as a traffic jam, a person running a red light...
We describe the TNO system and the evaluation results for TRECVID 2013 Multimedia Event Detection (M...
In this paper we summarize our TRECVID 2015 video recognition experiments. We participated in three ...
This paper strives for video event detection using a representation learned from deep convolutional ...
In this paper, we propose a discriminative video representation for event detection over a large sca...
This paper describes a system for multimedia event detection and recounting. The goal is to detect a...
This paper presents an overview and comparative analysis of our systems designed for the TRECVID 201...
Multimedia Event Detection (MED) aims to identify events—also called scenes—in videos, such as a flas...
In this paper, we propose a discriminative video rep-resentation for event detection over a large sc...
We describe the Raytheon BBN Technologies (BBN) led VISER system for the TRECVID 2013 Multimedia Eve...
We describe the Raytheon BBN Technologies (BBN) led VISER system for the TRECVID 2013 Multimedia Eve...
We describe the Raytheon BBN Technologies (BBN) led VISER system for the TRECVID 2012 Multimedia Eve...
We describe the Raytheon BBN (BBN) VISER system that is designed to detect events of interest in mul...
<p>In this paper, we present recent experiments on using Artificial Neural Networks (ANNs), a new “d...
The management of digital video has become a very challenging problem as the amount of video content...
We often come across events on our daily commute such as a traffic jam, a person running a red light...
We describe the TNO system and the evaluation results for TRECVID 2013 Multimedia Event Detection (M...
In this paper we summarize our TRECVID 2015 video recognition experiments. We participated in three ...
This paper strives for video event detection using a representation learned from deep convolutional ...
In this paper, we propose a discriminative video representation for event detection over a large sca...
This paper describes a system for multimedia event detection and recounting. The goal is to detect a...
This paper presents an overview and comparative analysis of our systems designed for the TRECVID 201...
Multimedia Event Detection (MED) aims to identify events—also called scenes—in videos, such as a flas...
In this paper, we propose a discriminative video rep-resentation for event detection over a large sc...