International audienceThe dynamic content of physical scenes is largely compositional, that is, the movements of the objects and of their parts are hierarchically organised and relate through composition along this hierarchy. This structure also prevails in the apparent 2D motion that a video captures. Accessing this visual motion hierarchy is important to get a better understanding of dynamic scenes and is useful for video manipulation. We propose to capture it through learned, tree-structured sparse coding of point trajectories. We leverage this new representation within an unsupervised clustering scheme to partition hierarchically the trajectories into meaningful groups. We show through experiments on motion capture data that our model i...
As early stage of video processing, we introduce an iter-ative trajectory merging algorithm that pro...
Recently, a video representation based on dense trajectories has been shown to outperform other huma...
This paper describes a method for building semantic scene models from video data using observed moti...
International audienceThe dynamic content of physical scenes is largely compositional, that is, the ...
Peer-reviewed paper accepted for presentation at the IEEE International Conference on Image Processi...
International audienceComplex activities, e.g., pole vaulting, are composed of a variable number of ...
AbstractThis paper presents a method to extract a part-based model of an observed scene from a video...
International audienceWe address the problem of recognizing complex activities, such as pole vaultin...
Extracting the 3D shape of deforming objects in monocular videos, a task known as non-rigid structur...
Point trajectories have emerged as a powerful means to obtain high quality and fully unsupervised se...
International audienceFeature trajectories have shown to be efficient for representing videos. Typic...
A system is described that tracks moving objects in a video dataset so as to extract a representatio...
Extracting 3D shape of deforming objects in monocular videos, a task known as non-rigid structure-fr...
We introduce a novel semi-supervised video segmentation approach based on an efficient video represe...
We introduce a semi-supervised video segmentation approach based on an efficient video representatio...
As early stage of video processing, we introduce an iter-ative trajectory merging algorithm that pro...
Recently, a video representation based on dense trajectories has been shown to outperform other huma...
This paper describes a method for building semantic scene models from video data using observed moti...
International audienceThe dynamic content of physical scenes is largely compositional, that is, the ...
Peer-reviewed paper accepted for presentation at the IEEE International Conference on Image Processi...
International audienceComplex activities, e.g., pole vaulting, are composed of a variable number of ...
AbstractThis paper presents a method to extract a part-based model of an observed scene from a video...
International audienceWe address the problem of recognizing complex activities, such as pole vaultin...
Extracting the 3D shape of deforming objects in monocular videos, a task known as non-rigid structur...
Point trajectories have emerged as a powerful means to obtain high quality and fully unsupervised se...
International audienceFeature trajectories have shown to be efficient for representing videos. Typic...
A system is described that tracks moving objects in a video dataset so as to extract a representatio...
Extracting 3D shape of deforming objects in monocular videos, a task known as non-rigid structure-fr...
We introduce a novel semi-supervised video segmentation approach based on an efficient video represe...
We introduce a semi-supervised video segmentation approach based on an efficient video representatio...
As early stage of video processing, we introduce an iter-ative trajectory merging algorithm that pro...
Recently, a video representation based on dense trajectories has been shown to outperform other huma...
This paper describes a method for building semantic scene models from video data using observed moti...