We introduce the first approach to solve the challenging problem of automatic 4D visual scene understanding for complex dynamic scenes with multiple interacting people from multi-view video. Our approach simultaneously estimates a detailed model that includes a per-pixel semantically and temporally coherent reconstruction, together with instance-level segmentation exploiting photo-consistency, semantic and motion information. We further leverage recent advances in 3D pose estimation to constrain the joint semantic instance segmentation and 4D temporally coherent reconstruction. This enables per person semantic instance segmentation of multiple interacting people in complex dynamic scenes. Extensive evaluation of the joint visual scene under...
International audienceIn this paper, we propose a new single shot method for multi-person 3D human p...
This paper introduces a general approach to dynamic scene reconstruction from multiple moving camera...
International audienceWe describe a method to obtain a pixel-wise segmentation and pose estimation o...
We introduce the first approach to solve the challenging problem of unsupervised 4D visual scene und...
Simultaneous semantically coherent object-based long-term 4D scene flow estimation, co-segmentation ...
This paper presents an approach for reconstruction of 4D temporally coherent models of complex dynam...
Existing techniques for dynamic scene reconstruction from multiple wide-baseline cameras primarily f...
In this paper we propose a framework for spatially and temporally coherent semantic co-segmentation ...
Abstract—We describe a method to obtain a pixel-wise segmentation and pose estimation of multiple pe...
The objective of this Thesis research is to develop algorithms for temporally consistent semantic se...
Abstract. Multiple human 3D pose estimation from multiple camera views is a challenging task in unco...
Abstract. Multiple human 3D pose estimation from multiple camera views is a challenging task in unco...
Abstract. We present an approach for joint inference of 3D scene struc-ture and semantic labeling fo...
International audienceIn this paper, we address the problem of segmenting consistently an evolving 3...
It is a challenging yet crucial task to have a comprehensive understanding of human activities and e...
International audienceIn this paper, we propose a new single shot method for multi-person 3D human p...
This paper introduces a general approach to dynamic scene reconstruction from multiple moving camera...
International audienceWe describe a method to obtain a pixel-wise segmentation and pose estimation o...
We introduce the first approach to solve the challenging problem of unsupervised 4D visual scene und...
Simultaneous semantically coherent object-based long-term 4D scene flow estimation, co-segmentation ...
This paper presents an approach for reconstruction of 4D temporally coherent models of complex dynam...
Existing techniques for dynamic scene reconstruction from multiple wide-baseline cameras primarily f...
In this paper we propose a framework for spatially and temporally coherent semantic co-segmentation ...
Abstract—We describe a method to obtain a pixel-wise segmentation and pose estimation of multiple pe...
The objective of this Thesis research is to develop algorithms for temporally consistent semantic se...
Abstract. Multiple human 3D pose estimation from multiple camera views is a challenging task in unco...
Abstract. Multiple human 3D pose estimation from multiple camera views is a challenging task in unco...
Abstract. We present an approach for joint inference of 3D scene struc-ture and semantic labeling fo...
International audienceIn this paper, we address the problem of segmenting consistently an evolving 3...
It is a challenging yet crucial task to have a comprehensive understanding of human activities and e...
International audienceIn this paper, we propose a new single shot method for multi-person 3D human p...
This paper introduces a general approach to dynamic scene reconstruction from multiple moving camera...
International audienceWe describe a method to obtain a pixel-wise segmentation and pose estimation o...