The objective of this work is person-clustering in videos – grouping characters according to their identity. Previous methods focus on the narrower task of face-clustering, and for the most part ignore other cues such as the person’s voice, their overall appearance (hair, clothes, posture), and the editing structure of the videos. Similarly, most current datasets evaluate only the task of face-clustering, rather than person-clustering. This limits their applicability to downstream applications such as story understanding which require person-level, rather than only face-level, reasoning.In this paper we make contributions to address both these deficiencies: first, we introduce a Multi-Modal High-Precision Clustering algorithm for person-clu...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
Accepted to ICCV 2019, code and data at https://github.com/makarandtapaswi/BallClustering_ICCV2019In...
International audienceThis paper describes our approach and results in the multi-modal person discov...
International audienceThe objective of this work is person-clustering in videos-grouping characters ...
International audienceThe objective of this work is person-clustering in videos-grouping characters ...
Multimedia databases are growing rapidly in size in the digital age. To increase the value of these ...
Human faces represent not only a challenging recognition problem for computer vision, but are also a...
The goal of this paper is unsupervised face clustering in edited video material – where face tracks ...
Clustering identities in a video is a useful task to aid in video search, annotation and retrieval, ...
Content-based people clustering is a crucial step for people indexing within video documents. In thi...
This thesis has investigated how to cluster a large number of faces within a multi-media corpus in t...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
Accepted to ICCV 2019, code and data at https://github.com/makarandtapaswi/BallClustering_ICCV2019In...
International audienceThis paper describes our approach and results in the multi-modal person discov...
International audienceThe objective of this work is person-clustering in videos-grouping characters ...
International audienceThe objective of this work is person-clustering in videos-grouping characters ...
Multimedia databases are growing rapidly in size in the digital age. To increase the value of these ...
Human faces represent not only a challenging recognition problem for computer vision, but are also a...
The goal of this paper is unsupervised face clustering in edited video material – where face tracks ...
Clustering identities in a video is a useful task to aid in video search, annotation and retrieval, ...
Content-based people clustering is a crucial step for people indexing within video documents. In thi...
This thesis has investigated how to cluster a large number of faces within a multi-media corpus in t...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
International audienceOur goal is to automatically identify faces in TV broadcast without a pre-defi...
Accepted to ICCV 2019, code and data at https://github.com/makarandtapaswi/BallClustering_ICCV2019In...
International audienceThis paper describes our approach and results in the multi-modal person discov...