We address the problem of learning a joint model of actors and actions in movies using weak supervision pro-vided by scripts. Specifically, we extract actor/action pairs from the script and use them as constraints in a discrimi-native clustering framework. The corresponding optimiza-tion problem is formulated as a quadratic program under linear constraints. People in video are represented by au-tomatically extracted and tracked faces together with cor-responding motion features. First, we apply the proposed framework to the task of learning names of characters in the movie and demonstrate significant improvements over previous methods used for this task. Second, we explore the joint actor/action constraint and show its advantage for weakly ...
International audienceThis paper exploits the context of natural dynamic scenes for human action rec...
Videos often depict complex scenes including people, objects and interactions between these and the ...
This dissertation targets the recognition of human actions in realistic video data, such as movies. ...
International audienceWe address the problem of learning a joint model of actors and actions in movi...
This paper addresses the problem of automatic temporal annotation of realistic human actions in vide...
As web and personal content become ever more enriched by videos, there is increasing need for semant...
The aim of this paper is to address recognition of natural human actions in diverse and realistic vi...
We address recognition and localization of human actions in realistic scenarios. In contrast to the ...
International audienceThe aim of this paper is to address recognition of natural human actions in di...
We introduce a generative model for learning person and costume specific detectors from labeled exam...
International audienceWe propose a novel human-centric approach to detect and localize human actions...
This work addresses the problem of recognizing actions and interactions in realistic video settings ...
International audienceWe introduce a generative model for learning person and costume specific detec...
Human behavior understanding is a fundamental problem of computer vision. It is an important compone...
We address the character identification problem in movies and television videos: assigning names to ...
International audienceThis paper exploits the context of natural dynamic scenes for human action rec...
Videos often depict complex scenes including people, objects and interactions between these and the ...
This dissertation targets the recognition of human actions in realistic video data, such as movies. ...
International audienceWe address the problem of learning a joint model of actors and actions in movi...
This paper addresses the problem of automatic temporal annotation of realistic human actions in vide...
As web and personal content become ever more enriched by videos, there is increasing need for semant...
The aim of this paper is to address recognition of natural human actions in diverse and realistic vi...
We address recognition and localization of human actions in realistic scenarios. In contrast to the ...
International audienceThe aim of this paper is to address recognition of natural human actions in di...
We introduce a generative model for learning person and costume specific detectors from labeled exam...
International audienceWe propose a novel human-centric approach to detect and localize human actions...
This work addresses the problem of recognizing actions and interactions in realistic video settings ...
International audienceWe introduce a generative model for learning person and costume specific detec...
Human behavior understanding is a fundamental problem of computer vision. It is an important compone...
We address the character identification problem in movies and television videos: assigning names to ...
International audienceThis paper exploits the context of natural dynamic scenes for human action rec...
Videos often depict complex scenes including people, objects and interactions between these and the ...
This dissertation targets the recognition of human actions in realistic video data, such as movies. ...