In this work, we look into the problem of recognizing two-person interactions in videos. Our method integrates multiple visual features in a weakly supervised manner by utilizing an embedding-based multiple instance learning framework. In our proposed method, first, several visual features that capture the shape and motion of the interacting people are extracted from each detected person region in a video. Then, two-person visual descriptors are formed. Since the relative spatial locations of interacting people are likely to complement the visual descriptors, we propose to use spatial multiple instance embedding, which implicitly incorporates the distances between people into the multiple instance learning process. Experimental results on t...
International audienceWe introduce an approach for learning human actions as interactions between pe...
We introduce a novel spatio-temporal deformable part model for offline detection of fine-grained int...
This thesis investigates the classification of the behaviour of multiple persons when viewed from a ...
Abstract In this work, we look into the problem of recognizing two-person interactions in videos. Ou...
In this paper, we address the problem of recognizing human interaction of two persons from videos. W...
The literature in human activity recognition is very broad and many different approaches have been p...
The objective of this work is recognition and spatiotemporal localization of two-person interactions...
The objective of this work is recognition and spatiotemporal localization of two-person interactions...
In this paper we address the problem of recognising interactions between two people in realistic sce...
International audienceGroup activity recognition from videos is a very challenging problem that has ...
International audienceBag of Visual Words Model (BoVW) has achieved impressive performance on human ...
In many cases, human actions can be identified not only by the singular observation of the human bod...
We introduce a novel spatiotemporal deformable part model for the localization of fine-grained human...
A dyadic interaction is a behavioral exchange between two people. In this thesis a computer framewor...
A dyadic interaction is a behavioral exchange between two people. In this thesis a computer framewor...
International audienceWe introduce an approach for learning human actions as interactions between pe...
We introduce a novel spatio-temporal deformable part model for offline detection of fine-grained int...
This thesis investigates the classification of the behaviour of multiple persons when viewed from a ...
Abstract In this work, we look into the problem of recognizing two-person interactions in videos. Ou...
In this paper, we address the problem of recognizing human interaction of two persons from videos. W...
The literature in human activity recognition is very broad and many different approaches have been p...
The objective of this work is recognition and spatiotemporal localization of two-person interactions...
The objective of this work is recognition and spatiotemporal localization of two-person interactions...
In this paper we address the problem of recognising interactions between two people in realistic sce...
International audienceGroup activity recognition from videos is a very challenging problem that has ...
International audienceBag of Visual Words Model (BoVW) has achieved impressive performance on human ...
In many cases, human actions can be identified not only by the singular observation of the human bod...
We introduce a novel spatiotemporal deformable part model for the localization of fine-grained human...
A dyadic interaction is a behavioral exchange between two people. In this thesis a computer framewor...
A dyadic interaction is a behavioral exchange between two people. In this thesis a computer framewor...
International audienceWe introduce an approach for learning human actions as interactions between pe...
We introduce a novel spatio-temporal deformable part model for offline detection of fine-grained int...
This thesis investigates the classification of the behaviour of multiple persons when viewed from a ...