Scene understanding, including object recognition, is perhaps the most challenging task in computer vision. Deep convolutional neural networks (CNNs) have received a flurry of interest in the past few years due to their superior performance. However, deep networks are computationally expensive and without efficient implementation on high performance computing systems not as practical as older methods. Furthermore, CNNs do not benefit from the human's visual selective attention and top-down contextual feedback connections. The human visual system makes extensive use of contextual information to facilitate and refine object detections; object detection and recognition based only on intrinsic features of target objects is not usually sufficien...
© . This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommo...
Feedforward deep convolutional neural networks (DCNNs) are, under specific conditions, matching and ...
The human visual system can recognize several thousand object categories irrespective of their posit...
There have been significant improvements in the accuracy of scene understanding due to a shift from ...
This thesis concerns itself with the use and examination of convolutional neural networks in the con...
Knowing where to look in an image can significantly improve performance in computer vision tasks by ...
Deep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative objec...
Understanding and interacting with one’s environment requires parsing the image of the environment ...
Understanding objects in complex scenes is a fundamental and challenging problem in computer vision....
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
A fundamental problem in computer vision is knowing what is in the image and where it is. We develop...
David Marr famously defined vision as "knowing what is where by seeing". In the framework described ...
Object detection has improved very rapidly in the last decades, but because they are very essential ...
Abstract. Scene understanding is an important problem in intelligent robotics. Since visual informat...
Visual saliency computation is about detecting and understanding salient regions and elements in a v...
© . This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommo...
Feedforward deep convolutional neural networks (DCNNs) are, under specific conditions, matching and ...
The human visual system can recognize several thousand object categories irrespective of their posit...
There have been significant improvements in the accuracy of scene understanding due to a shift from ...
This thesis concerns itself with the use and examination of convolutional neural networks in the con...
Knowing where to look in an image can significantly improve performance in computer vision tasks by ...
Deep convolutional neural networks (CNNs) have shown a strong ability in mining discriminative objec...
Understanding and interacting with one’s environment requires parsing the image of the environment ...
Understanding objects in complex scenes is a fundamental and challenging problem in computer vision....
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
A fundamental problem in computer vision is knowing what is in the image and where it is. We develop...
David Marr famously defined vision as "knowing what is where by seeing". In the framework described ...
Object detection has improved very rapidly in the last decades, but because they are very essential ...
Abstract. Scene understanding is an important problem in intelligent robotics. Since visual informat...
Visual saliency computation is about detecting and understanding salient regions and elements in a v...
© . This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommo...
Feedforward deep convolutional neural networks (DCNNs) are, under specific conditions, matching and ...
The human visual system can recognize several thousand object categories irrespective of their posit...