Abstract: In this paper, we review and propose methods for audio speaker segmentation, video costume segmentation and the automatic association between each voice and the images containing the corresponding visual person. This association can be used as a preprocessing step for existing applications like person identification systems. The first step consists in fusing, without any a priori knowledge, the two indexes produced by audio and video segmentations, in order to make the information brought by each of them more robust. Evaluation is done on a corpus composed of French TV broadcasts. If both audio and video streams are correctly segmented, this automatic association yields excellent results. When the two streams are oversegmented, ou...
International audienceThis paper describes a multi-modal person recognition system for video broadca...
International audienceThis paper describes a multi-modal person recognition system for video broadca...
International audienceThis paper describes a multi-modal person recognition system for video broadca...
In the audiovisual indexing context, we propose and experi-ment a method that, from an audio speaker...
In the audiovisual indexing context, we propose a system for automatic association of voices and ima...
This thesis consists to propose a method for an unsupervised characterization of persons within audi...
This thesis consists to propose a method for an unsupervised characterization of persons within audi...
Multimedia databases are growing rapidly in size in the digital age. To increase the value of these ...
Comunicació presentada a: the 15th International Workshop on Content-Based Multimedia Indexing (CBMI...
Comunicació presentada a: the 15th International Workshop on Content-Based Multimedia Indexing (CBMI...
L'analyse automatique de contenu des vidéos en vue de leur annotation est un domaine de recherche en...
We investigate the problem of audio-visual (AV) person diarization in broadcast data. That is, autom...
With increasing internet use, the amount of multimedia content multiplies, making it necessary to de...
International audienceWe investigate the problem of audiovisual (AV) person di-arization in broadcas...
With increasing internet use, the amount of multimedia content multiplies, making it necessary to de...
International audienceThis paper describes a multi-modal person recognition system for video broadca...
International audienceThis paper describes a multi-modal person recognition system for video broadca...
International audienceThis paper describes a multi-modal person recognition system for video broadca...
In the audiovisual indexing context, we propose and experi-ment a method that, from an audio speaker...
In the audiovisual indexing context, we propose a system for automatic association of voices and ima...
This thesis consists to propose a method for an unsupervised characterization of persons within audi...
This thesis consists to propose a method for an unsupervised characterization of persons within audi...
Multimedia databases are growing rapidly in size in the digital age. To increase the value of these ...
Comunicació presentada a: the 15th International Workshop on Content-Based Multimedia Indexing (CBMI...
Comunicació presentada a: the 15th International Workshop on Content-Based Multimedia Indexing (CBMI...
L'analyse automatique de contenu des vidéos en vue de leur annotation est un domaine de recherche en...
We investigate the problem of audio-visual (AV) person diarization in broadcast data. That is, autom...
With increasing internet use, the amount of multimedia content multiplies, making it necessary to de...
International audienceWe investigate the problem of audiovisual (AV) person di-arization in broadcas...
With increasing internet use, the amount of multimedia content multiplies, making it necessary to de...
International audienceThis paper describes a multi-modal person recognition system for video broadca...
International audienceThis paper describes a multi-modal person recognition system for video broadca...
International audienceThis paper describes a multi-modal person recognition system for video broadca...