Recently, newly invented features (e.g. Fisher vector, VLAD) have achieved state-of-the-art performance in large-scale video analysis systems that aims to understand the contents in videos, such as concept recognition and event detection. However, these features are in high-dimensional representations, which remarkably increases computation costs and correspondingly deteriorates the performance of subsequent learning tasks. Notably, the situation becomes even worse when dealing with large-scale video data where the number of class labels are limited. To address this problem, we propose a novel algorithm to compactly represent huge amounts of unconstrained video data. Specifically, redundant feature dimensions are removed by using our propos...
Explosive growth of multimedia data has brought challenge of how to efficiently browse, retrieve and...
In the last decade, we have witnessed exponential growth of visual content in internet social media ...
Recognition of complex events in consumer uploaded Internet videos, captured under realworld setting...
Recently, newly invented features (e.g. Fisher vector, VLAD) have achieved state-of-the-art performa...
Feature selection is essential for effective visual recognition. We propose an efficient joint class...
<p>With widespread availability of low-cost devices capable of photo shooting and high-volume video ...
In this thesis, we investigate different representations and models for large-scale video understand...
Abstract Video data are usually represented by high dimensional features. The performance of video s...
Complex video analysis is a challenging problem due to the long and sophisticated temporal structure...
© 2017 IEEE. Video semantic recognition usually suffers from the curse of dimensionality and the abs...
In this paper, we propose a novel semi-supervised feature analyzing framework for multimedia data un...
© 2014 IEEE. To improve both the efficiency and accuracy of video semantic recognition, we can perfo...
We consider the problem of extracting descriptors that represent visually salient portions of a vide...
Video analysis has been attracting increasing research due to the proliferation of internet videos. ...
Explosive growth of multimedia data has brought challenge of how to efficiently browse, retrieve and...
Explosive growth of multimedia data has brought challenge of how to efficiently browse, retrieve and...
In the last decade, we have witnessed exponential growth of visual content in internet social media ...
Recognition of complex events in consumer uploaded Internet videos, captured under realworld setting...
Recently, newly invented features (e.g. Fisher vector, VLAD) have achieved state-of-the-art performa...
Feature selection is essential for effective visual recognition. We propose an efficient joint class...
<p>With widespread availability of low-cost devices capable of photo shooting and high-volume video ...
In this thesis, we investigate different representations and models for large-scale video understand...
Abstract Video data are usually represented by high dimensional features. The performance of video s...
Complex video analysis is a challenging problem due to the long and sophisticated temporal structure...
© 2017 IEEE. Video semantic recognition usually suffers from the curse of dimensionality and the abs...
In this paper, we propose a novel semi-supervised feature analyzing framework for multimedia data un...
© 2014 IEEE. To improve both the efficiency and accuracy of video semantic recognition, we can perfo...
We consider the problem of extracting descriptors that represent visually salient portions of a vide...
Video analysis has been attracting increasing research due to the proliferation of internet videos. ...
Explosive growth of multimedia data has brought challenge of how to efficiently browse, retrieve and...
Explosive growth of multimedia data has brought challenge of how to efficiently browse, retrieve and...
In the last decade, we have witnessed exponential growth of visual content in internet social media ...
Recognition of complex events in consumer uploaded Internet videos, captured under realworld setting...