Audio-visual event detection aims to identify semantically defined events that reveal human activities. Most previous literature focused on restricted highlight events, and depended on highly ad-hoc detectors for these events. This research emphasizes generalizable robust modeling of single-microphone audio cues and/or single-camera visual cues for the detection of real-world events, requiring no expensive annotation other than the known timestamps of the training events. To model the audio cues for event detection, we leverage statistical models proven effective in speech recognition. First, a tandem connectionist-HMM approach combines the sequence modeling capabilities of the hidden Markov model (HMM) with the context-dependent discrim...
This paper investigates the use of unlabeled data to help labeled data for audio-visual event recogn...
Semantic-level content analysis is a crucial issue in achieving efficient content retrieval and man...
The management of digital video has become a very challenging problem as the amount of video content...
Audio-visual event detection aims to identify semantically defined events that reveal human activiti...
Non-speech acoustic event detection (AED) aims to recognize events that are relevant to human activi...
The detection of the acoustic events (AEs) that are naturally produced in a meeting room may help t...
Research articleAcoustic event detection (AED) aims at determining the identity of sounds and their ...
Acoustic Event Detection The detection of the Acoustic Events (AEs) naturally produced in a meeting ...
Acoustic Event Detection The detection of the Acoustic Events (AEs) naturally produced in a meeting ...
Copyright © 2011 Taras Butko et al. This is an open access article distributed under the Creative Co...
Multimedia event detection (MED) on user-generated content is the task of finding an event, e.g., a ...
<p>In this paper, we present recent experiments on using Artificial Neural Networks (ANNs), a new “d...
Non-speech acoustic event detection (AED) aims to recognize events that are relevant to human activi...
Multimedia Event Detection (MED) aims to identify events—also called scenes—in videos, such as a flas...
This paper investigates the use of unlabeled data to help la-beled data for audio-visual event recog...
This paper investigates the use of unlabeled data to help labeled data for audio-visual event recogn...
Semantic-level content analysis is a crucial issue in achieving efficient content retrieval and man...
The management of digital video has become a very challenging problem as the amount of video content...
Audio-visual event detection aims to identify semantically defined events that reveal human activiti...
Non-speech acoustic event detection (AED) aims to recognize events that are relevant to human activi...
The detection of the acoustic events (AEs) that are naturally produced in a meeting room may help t...
Research articleAcoustic event detection (AED) aims at determining the identity of sounds and their ...
Acoustic Event Detection The detection of the Acoustic Events (AEs) naturally produced in a meeting ...
Acoustic Event Detection The detection of the Acoustic Events (AEs) naturally produced in a meeting ...
Copyright © 2011 Taras Butko et al. This is an open access article distributed under the Creative Co...
Multimedia event detection (MED) on user-generated content is the task of finding an event, e.g., a ...
<p>In this paper, we present recent experiments on using Artificial Neural Networks (ANNs), a new “d...
Non-speech acoustic event detection (AED) aims to recognize events that are relevant to human activi...
Multimedia Event Detection (MED) aims to identify events—also called scenes—in videos, such as a flas...
This paper investigates the use of unlabeled data to help la-beled data for audio-visual event recog...
This paper investigates the use of unlabeled data to help labeled data for audio-visual event recogn...
Semantic-level content analysis is a crucial issue in achieving efficient content retrieval and man...
The management of digital video has become a very challenging problem as the amount of video content...