Accurate and efficient video classification demands the fusion of multimodal information and the use of interme-diate representations. Combining the two ideas into the one framework, we propose a probabilistic approach for video classification using intermediate semantic repre-sentations derived from multi-modal features. Based on a class of bipartite undirected graphical models named harmonium, our approach represents the video data as latent semantic topics derived by jointly modeling the transcript keywords and color-histogram features, and performs classification using these latent topics under a unified framework. We show satisfactory classification performance of our approach on a benchmark dataset as well as interesting insights into...
Multimedia documentalists need effective tools to organize and search into large video collections. ...
In this paper, we propose a novel Spatio-Temporal Video Retrieval model to extract spatio-temporal a...
This paper presents a novel unsupervised method for identifying the semantic structure in long semi-...
Research Doctorate - Doctor of Philosophy (PhD)This thesis investigates the problem of seeking multi...
Abstract—This paper presents a novel method for automati-cally classifying consumer video clips base...
In this paper, we propose a generative model-based approach for audio-visual event classification. T...
Digital video now plays an important role in medical education and healthcare, but our ability to au...
Abstract Video data are usually represented by high dimensional features. The performance of video s...
We address the problem of classifying complex videos based on their content. A typical approach to t...
We present an architecture for learning semantic multi-modal video representations to learn semantic...
Description of human activities in videos results not only in detection of actions and objects but a...
According to some current thinking, a very large number of semantic concepts could provide researche...
We focus on the problem of learning semantics from multimedia data associated with broad- cast video...
Abstract In this paper we describe a multi-strategy approach to improving semantic extraction from n...
We focus on the problem of learning semantics from multimedia data associated with broadcast video d...
Multimedia documentalists need effective tools to organize and search into large video collections. ...
In this paper, we propose a novel Spatio-Temporal Video Retrieval model to extract spatio-temporal a...
This paper presents a novel unsupervised method for identifying the semantic structure in long semi-...
Research Doctorate - Doctor of Philosophy (PhD)This thesis investigates the problem of seeking multi...
Abstract—This paper presents a novel method for automati-cally classifying consumer video clips base...
In this paper, we propose a generative model-based approach for audio-visual event classification. T...
Digital video now plays an important role in medical education and healthcare, but our ability to au...
Abstract Video data are usually represented by high dimensional features. The performance of video s...
We address the problem of classifying complex videos based on their content. A typical approach to t...
We present an architecture for learning semantic multi-modal video representations to learn semantic...
Description of human activities in videos results not only in detection of actions and objects but a...
According to some current thinking, a very large number of semantic concepts could provide researche...
We focus on the problem of learning semantics from multimedia data associated with broad- cast video...
Abstract In this paper we describe a multi-strategy approach to improving semantic extraction from n...
We focus on the problem of learning semantics from multimedia data associated with broadcast video d...
Multimedia documentalists need effective tools to organize and search into large video collections. ...
In this paper, we propose a novel Spatio-Temporal Video Retrieval model to extract spatio-temporal a...
This paper presents a novel unsupervised method for identifying the semantic structure in long semi-...