Visual perception is a fundamental task of computer vision. Subtasks within perception can be decomposed into two types: reasoning about the generative process of images or phenomena themselves (i.e., a prior on the sensory input we expect to perceive) as well as discriminating high-level structural information contained within them. In this dissertation, we first explore how this decomposition is not as decoupled as it might appear. In particular, we show how the act of discrimination is sufficient to produce a model of generative ability. Furthermore, while vision algorithms are required to discriminate about visible aspects of a scene, it is often useful to reason about what cannot be seen within a 2D observation. We explore a practical ...
The goal of scene understanding is to capture the full content of an image in a human-interpretable ...
Unsupervised algorithms which do not make use of labels are commonly found in computer vision and ar...
In this paper we present a Bayesian framework for parsing images into their constituent visual patte...
Visual perception is a fundamental task of computer vision. Subtasks within perception can be decomp...
A successful vision system must solve the problem of deriving geometrical information about three-di...
This thesis concerns itself with the use and examination of convolutional neural networks in the con...
<p>Scene perception is a fundamental aspect of vision. Humans are capable of analyzing behaviorally-...
Scene understanding is one of the holy grails of computer vision. Despite decades of research on sce...
We introduce a framework to learn object segmentation from a collection of images without any manual...
Although humans can effortlessly recognise a scene in its totality, it is an extremely challenging p...
Natural images arise from complicated processes involving many factors of variation. They reflect th...
Visual pattern detection and discrimination are essential first steps for scene analysis. Numerous h...
We present a generative model of images that explicitly reasons over the set of objects they show. O...
Visual object recognition is one of the key human capabilities that we would like machines to have. ...
An instance with a bad mask might make a composite image that uses it look fake. This encourages us ...
The goal of scene understanding is to capture the full content of an image in a human-interpretable ...
Unsupervised algorithms which do not make use of labels are commonly found in computer vision and ar...
In this paper we present a Bayesian framework for parsing images into their constituent visual patte...
Visual perception is a fundamental task of computer vision. Subtasks within perception can be decomp...
A successful vision system must solve the problem of deriving geometrical information about three-di...
This thesis concerns itself with the use and examination of convolutional neural networks in the con...
<p>Scene perception is a fundamental aspect of vision. Humans are capable of analyzing behaviorally-...
Scene understanding is one of the holy grails of computer vision. Despite decades of research on sce...
We introduce a framework to learn object segmentation from a collection of images without any manual...
Although humans can effortlessly recognise a scene in its totality, it is an extremely challenging p...
Natural images arise from complicated processes involving many factors of variation. They reflect th...
Visual pattern detection and discrimination are essential first steps for scene analysis. Numerous h...
We present a generative model of images that explicitly reasons over the set of objects they show. O...
Visual object recognition is one of the key human capabilities that we would like machines to have. ...
An instance with a bad mask might make a composite image that uses it look fake. This encourages us ...
The goal of scene understanding is to capture the full content of an image in a human-interpretable ...
Unsupervised algorithms which do not make use of labels are commonly found in computer vision and ar...
In this paper we present a Bayesian framework for parsing images into their constituent visual patte...