Computer vision and image understanding is the problem of interpreting images by locating, recognizing objects, attributes and other higher level features in an image. In this thesis, I seek to tackle this broad problem using deep learning techniques. More specifically, I build deep neural network based models to solve two specific problems to understand images in a high level: album wise image understanding with event-specific image importance score, and description generation for an image. I first focus on the understanding of a collection of images in an event album. In an event album, some images are more important or interesting to save or present than others, and I show that with an event-specific image importance property, we can ...
Our answer is, if used for challenging computer vision tasks, attributes are useful privileged data....
This Paper will involve developing a model that generates suitable captions for images. This will he...
Most of the approaches for discovering visual attributes in images demand significant supervision, w...
Computer vision and image understanding is the problem of interpreting images by locating, recognizi...
As Deep learning emerges from Machine learning to become a leading technology in today’s day and age...
In 2018, the number of mobile phone users will reach about 4.9 billion. Assuming an aver...
Classifying images helps computers identify more things that humans can see. to identify the picture...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Deep Learning has attained state-of-the-art performance in the recent years, but it is still hard to...
I present my work towards learning a better computer vision system that learns and generalizes objec...
Efficient Image data storage and tagged Image Archive are quintessential in organizations due to the...
Deep Convolutional Neural Networks (DCNNs) have achieved superior performance in many computer visio...
Describing the contents of images is a challenging task for machines to achieve. It requires not onl...
In this paper two new learning-based eXplainable AI (XAI) methods for deep convolutional neural netw...
Event recognition from still images is one of the most im-portant problems for image understanding. ...
Our answer is, if used for challenging computer vision tasks, attributes are useful privileged data....
This Paper will involve developing a model that generates suitable captions for images. This will he...
Most of the approaches for discovering visual attributes in images demand significant supervision, w...
Computer vision and image understanding is the problem of interpreting images by locating, recognizi...
As Deep learning emerges from Machine learning to become a leading technology in today’s day and age...
In 2018, the number of mobile phone users will reach about 4.9 billion. Assuming an aver...
Classifying images helps computers identify more things that humans can see. to identify the picture...
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Comp...
Deep Learning has attained state-of-the-art performance in the recent years, but it is still hard to...
I present my work towards learning a better computer vision system that learns and generalizes objec...
Efficient Image data storage and tagged Image Archive are quintessential in organizations due to the...
Deep Convolutional Neural Networks (DCNNs) have achieved superior performance in many computer visio...
Describing the contents of images is a challenging task for machines to achieve. It requires not onl...
In this paper two new learning-based eXplainable AI (XAI) methods for deep convolutional neural netw...
Event recognition from still images is one of the most im-portant problems for image understanding. ...
Our answer is, if used for challenging computer vision tasks, attributes are useful privileged data....
This Paper will involve developing a model that generates suitable captions for images. This will he...
Most of the approaches for discovering visual attributes in images demand significant supervision, w...