Video content characterization is a challenging problem in video databases. The aim of such characterization is to generate indices that can describe a video clip in terms of objects and their actions in the clip. Generally, such indices are extracted by performing image analysis on video clips. Many such indices can also be generated by analyzing the embedded audio information of video clips. Indices pertaining to context, scene emotion, and actors or characters present in a video clip appear especially suitable for generation via audio analysis techniques of keyword spotting, and speech and speaker recognition. In this paper, we examine the potential of speaker identification techniques for characterizing video clips in terms of actors pr...
Abstract: A video‘s soundtrack is usually highly correlated to its content. Hence, audio-based techn...
The organization of video data-bases according to semantic content of data, is a key point in multim...
International audienceWe present in this paper preliminary results using speaker recognition and spe...
Determining automatically what constitutes a scene in a video is a challenging task, particularly si...
Determining automatically what constitutes a scene in a video is a challenging task, particularly si...
A challenging problem to construct video databases is the organization of video information. The dev...
Determining automatically what constitutes a scene in a video is a challenging task, particularly si...
Determining automatically what constitutes a scene in a video is a challenging task, particularly si...
Disclosed herein is a mechanism for generating and providing captions based on speaker identificatio...
A novel technique, which uses a joint audio-visual analysis for scene identification and characteriz...
We describe a scheme to combine the results of audio and face identification for multimedia indexing...
Abstract: In this paper, we review and propose methods for audio speaker segmentation, video costume...
With the rapid growth of the multimedia data, especially for videos, the ability to better and time-...
The organization of video databases according to the semantic content of data is a key point in mult...
This paper describes a multi-modal person recognition sys-tem for video broadcast developed for part...
Abstract: A video‘s soundtrack is usually highly correlated to its content. Hence, audio-based techn...
The organization of video data-bases according to semantic content of data, is a key point in multim...
International audienceWe present in this paper preliminary results using speaker recognition and spe...
Determining automatically what constitutes a scene in a video is a challenging task, particularly si...
Determining automatically what constitutes a scene in a video is a challenging task, particularly si...
A challenging problem to construct video databases is the organization of video information. The dev...
Determining automatically what constitutes a scene in a video is a challenging task, particularly si...
Determining automatically what constitutes a scene in a video is a challenging task, particularly si...
Disclosed herein is a mechanism for generating and providing captions based on speaker identificatio...
A novel technique, which uses a joint audio-visual analysis for scene identification and characteriz...
We describe a scheme to combine the results of audio and face identification for multimedia indexing...
Abstract: In this paper, we review and propose methods for audio speaker segmentation, video costume...
With the rapid growth of the multimedia data, especially for videos, the ability to better and time-...
The organization of video databases according to the semantic content of data is a key point in mult...
This paper describes a multi-modal person recognition sys-tem for video broadcast developed for part...
Abstract: A video‘s soundtrack is usually highly correlated to its content. Hence, audio-based techn...
The organization of video data-bases according to semantic content of data, is a key point in multim...
International audienceWe present in this paper preliminary results using speaker recognition and spe...