Current approaches to semantic image and scene understanding typically employ rather simple object representations such as 2D or 3D bounding boxes. While such coarse models are robust and allow for reliable object detection, they discard much of the information about objects' 3D shape and pose, and thus do not lend themselves well to higher-level reasoning. Here, we propose to base scene understanding on a high-resolution object representation. An object class—in our case cars—is modeled as a deformable 3D wireframe, which enables fine-grained modeling at the level of individual vertices and faces. We augment that model to explicitly include vertex-level occlusion, and embed all instances in a common coordinate frame, in order to infer and ...
Humans possess a remarkable ability to extract general object representations from a single image, c...
We present a method that estimates in real-time and un-der challenging conditions the 3D pose of a k...
Improvement in acquisition systems, has resulted in the ability to capture more realistic 3D models ...
Despite the success of current state-of-the-art object class detectors, severe occlusion remains a m...
In this thesis, we propose novel 2D and 3D models for object and scene understanding from images. Th...
Current object class recognition systems typically target 2D bounding box localization, encouraged b...
In this thesis, we describe a data-driven approach to leverage repositories of 3D models for scene u...
Geometric 3D reasoning has received renewed attention recently, in the context of visual scene under...
Current object class recognition systems typically target 2D bounding box localization, encouraged b...
This thesis contributes to the emerging field of 3D scene understanding. That is, given a 3D scene r...
In this paper, we propose a data-driven approach to leverage repositories of 3D models for scene und...
Following recent advances in detection, context modeling, and tracking, scene understanding has been...
This paper addresses the problem of category-level 3D object detection. Given a monocular image, our...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
When humans look at an image, they see not just a pattern of color and texture, but the world behind...
Humans possess a remarkable ability to extract general object representations from a single image, c...
We present a method that estimates in real-time and un-der challenging conditions the 3D pose of a k...
Improvement in acquisition systems, has resulted in the ability to capture more realistic 3D models ...
Despite the success of current state-of-the-art object class detectors, severe occlusion remains a m...
In this thesis, we propose novel 2D and 3D models for object and scene understanding from images. Th...
Current object class recognition systems typically target 2D bounding box localization, encouraged b...
In this thesis, we describe a data-driven approach to leverage repositories of 3D models for scene u...
Geometric 3D reasoning has received renewed attention recently, in the context of visual scene under...
Current object class recognition systems typically target 2D bounding box localization, encouraged b...
This thesis contributes to the emerging field of 3D scene understanding. That is, given a 3D scene r...
In this paper, we propose a data-driven approach to leverage repositories of 3D models for scene und...
Following recent advances in detection, context modeling, and tracking, scene understanding has been...
This paper addresses the problem of category-level 3D object detection. Given a monocular image, our...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
When humans look at an image, they see not just a pattern of color and texture, but the world behind...
Humans possess a remarkable ability to extract general object representations from a single image, c...
We present a method that estimates in real-time and un-der challenging conditions the 3D pose of a k...
Improvement in acquisition systems, has resulted in the ability to capture more realistic 3D models ...