The ability of a perception system to discern what is important in a scene and what is not is an invaluable asset, with multiple applications in object recognition, people detection and SLAM, among others. In this paper, we aim to analyze all sensory data available to separate a scene into a few physically meaningful parts, which we term structure, while discarding background clutter. In particular, we consider the combination of image and range data, and base our decision in both appearance and 3D shape. Our main contribution is the development of a framework to perform scene segmentation that preserves physical objects using multi-modal data. We combine image and range data using a novel mid-level fusion technique based on the concept of ...
Object detection and multi-class image segmentation are two closely related tasks that can be greatl...
Whether for purposes of compression efficiency, image editing, interactive multimedia authoring, or ...
Given a 3-D range image of a scene containing multiple arbi-trarily shaped objects, we segment the s...
High-level, or holistic, scene understanding involves reasoning about objects, regions, and the 3D r...
How does the visual system encode natural scenes? What are the basic structures of natural scenes? I...
The problem of structure detection in images involves the identification of local groups of pixels t...
Multi-scale window scanning has been popular in object detection but it generalizes poorly to comple...
textIn this thesis, I explore region detection and consider its impact on image matching for exempla...
Scene recognition is a fundamental and open problem in computer vision. It is an essential component...
A feature detector and a feature descriptor are presented, which are applicable to 3D range data. Th...
Techniques that are fast for matching in computer vision, such as the bag-of-features model, general...
Humans can grasp the gist of complex natural scenes very quickly and can remember extraordinarily ri...
In this paper, we introduce a multi-modal graphical model to address the problems of semantic segmen...
In this paper, we propose a strategy to detect objects from still images that relies on combining tw...
This thesis addresses the problem of learning object classification using multi-view range data. Cla...
Object detection and multi-class image segmentation are two closely related tasks that can be greatl...
Whether for purposes of compression efficiency, image editing, interactive multimedia authoring, or ...
Given a 3-D range image of a scene containing multiple arbi-trarily shaped objects, we segment the s...
High-level, or holistic, scene understanding involves reasoning about objects, regions, and the 3D r...
How does the visual system encode natural scenes? What are the basic structures of natural scenes? I...
The problem of structure detection in images involves the identification of local groups of pixels t...
Multi-scale window scanning has been popular in object detection but it generalizes poorly to comple...
textIn this thesis, I explore region detection and consider its impact on image matching for exempla...
Scene recognition is a fundamental and open problem in computer vision. It is an essential component...
A feature detector and a feature descriptor are presented, which are applicable to 3D range data. Th...
Techniques that are fast for matching in computer vision, such as the bag-of-features model, general...
Humans can grasp the gist of complex natural scenes very quickly and can remember extraordinarily ri...
In this paper, we introduce a multi-modal graphical model to address the problems of semantic segmen...
In this paper, we propose a strategy to detect objects from still images that relies on combining tw...
This thesis addresses the problem of learning object classification using multi-view range data. Cla...
Object detection and multi-class image segmentation are two closely related tasks that can be greatl...
Whether for purposes of compression efficiency, image editing, interactive multimedia authoring, or ...
Given a 3-D range image of a scene containing multiple arbi-trarily shaped objects, we segment the s...