In the audiovisual indexing context, we propose and experi-ment a method that, from an audio speaker segmentation and a video costume segmentation made on an audiovisual doc-ument, makes an automatic association between each voice and the images containing the corresponding visual person. This association can be used as a preprocessing step for exist-ing applications like person identification systems. The first step consists in fusing, without any a priori knowledge, the two indexes produced by audio and video segmentations, in order to make the information brought by each of them more robust. Evaluation is done on a corpus composed of French TV broadcasts. If both audio and video streams are correctly segmented, this automatic association...
International audienceAudio-Visual People Diarization (AVPD) is an original framework that simultane...
Comunicació presentada a: the 15th International Workshop on Content-Based Multimedia Indexing (CBMI...
We describe a scheme to combine the results of audio and face identification for multimedia indexing...
Abstract: In this paper, we review and propose methods for audio speaker segmentation, video costume...
In the audiovisual indexing context, we propose a system for automatic association of voices and ima...
This thesis consists to propose a method for an unsupervised characterization of persons within audi...
This thesis consists to propose a method for an unsupervised characterization of persons within audi...
We investigate the problem of audio-visual (AV) person diarization in broadcast data. That is, autom...
With increasing internet use, the amount of multimedia content multiplies, making it necessary to de...
International audienceWe investigate the problem of audiovisual (AV) person di-arization in broadcas...
We propose a system which permits to describe and struc-ture audiovisual documents without training,...
With increasing internet use, the amount of multimedia content multiplies, making it necessary to de...
Multimedia databases are growing rapidly in size in the digital age. To increase the value of these ...
Comunicació presentada a: the 15th International Workshop on Content-Based Multimedia Indexing (CBMI...
L'analyse automatique de contenu des vidéos en vue de leur annotation est un domaine de recherche en...
International audienceAudio-Visual People Diarization (AVPD) is an original framework that simultane...
Comunicació presentada a: the 15th International Workshop on Content-Based Multimedia Indexing (CBMI...
We describe a scheme to combine the results of audio and face identification for multimedia indexing...
Abstract: In this paper, we review and propose methods for audio speaker segmentation, video costume...
In the audiovisual indexing context, we propose a system for automatic association of voices and ima...
This thesis consists to propose a method for an unsupervised characterization of persons within audi...
This thesis consists to propose a method for an unsupervised characterization of persons within audi...
We investigate the problem of audio-visual (AV) person diarization in broadcast data. That is, autom...
With increasing internet use, the amount of multimedia content multiplies, making it necessary to de...
International audienceWe investigate the problem of audiovisual (AV) person di-arization in broadcas...
We propose a system which permits to describe and struc-ture audiovisual documents without training,...
With increasing internet use, the amount of multimedia content multiplies, making it necessary to de...
Multimedia databases are growing rapidly in size in the digital age. To increase the value of these ...
Comunicació presentada a: the 15th International Workshop on Content-Based Multimedia Indexing (CBMI...
L'analyse automatique de contenu des vidéos en vue de leur annotation est un domaine de recherche en...
International audienceAudio-Visual People Diarization (AVPD) is an original framework that simultane...
Comunicació presentada a: the 15th International Workshop on Content-Based Multimedia Indexing (CBMI...
We describe a scheme to combine the results of audio and face identification for multimedia indexing...