Institute for Adaptive and Neural ComputationDeveloping computer vision algorithms able to learn from unsegmented images containing multiple objects is important since this is how humans constantly learn from visual experiences. In this thesis we consider images containing views of multiple objects and our task is to learn about each of the objects present in the images. This task can be approached as a factorial learning problem, where each image is explained by instantiating a model for each of the objects present with the correct instantiation parameters. A major problem with learning a factorial model is that as the number of objects increases, there is a combinatorial explosion of the number of configurations that need to be consi...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Single view imaging data has been used in most previous research in computer vision and image under...
Developing computer vision algorithms able to learn from unsegmented images containing multiple obje...
We consider data that are images containing views of multiple objects. Our task is to learn about ea...
We consider data which are images containing views of multiple objects. Our task is to learn about e...
We propose a method to learn heterogeneous models of object classes for visual recognition. The tra...
With the widespread deployment of sensors and the Internet-of-Things, multi-view data have become mo...
This dissertation addresses the task of detecting instances of object categories in photographs. We ...
We present a method of recognizing three-dimensional objects in intensity images of cluttered scene...
We study unsupervised learning of occluding objects in images of visual scenes. The derived learning...
The fate of contemporary scientific research in biology and medicine is bound to the advancements in...
Learning visual models of object categories notoriously requires hundreds or thousands of training e...
The task of scene understanding involves recognizing the different objects present in the scene, seg...
A key problem in learning representations of multiple objects from unlabeled images is that it is a ...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Single view imaging data has been used in most previous research in computer vision and image under...
Developing computer vision algorithms able to learn from unsegmented images containing multiple obje...
We consider data that are images containing views of multiple objects. Our task is to learn about ea...
We consider data which are images containing views of multiple objects. Our task is to learn about e...
We propose a method to learn heterogeneous models of object classes for visual recognition. The tra...
With the widespread deployment of sensors and the Internet-of-Things, multi-view data have become mo...
This dissertation addresses the task of detecting instances of object categories in photographs. We ...
We present a method of recognizing three-dimensional objects in intensity images of cluttered scene...
We study unsupervised learning of occluding objects in images of visual scenes. The derived learning...
The fate of contemporary scientific research in biology and medicine is bound to the advancements in...
Learning visual models of object categories notoriously requires hundreds or thousands of training e...
The task of scene understanding involves recognizing the different objects present in the scene, seg...
A key problem in learning representations of multiple objects from unlabeled images is that it is a ...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Compute...
Single view imaging data has been used in most previous research in computer vision and image under...