An active object recognition system has the advantage of acting in the environment to capture images that are more suited for training and lead to better performance at test time. In this paper, we utilize deep convolutional neural networks for active object recognition by simultaneously predicting the object label and the next action to be performed on the object with the aim of improving recognition performance. We treat active object recognition as a reinforcement learning problem and derive the cost function to train the network for joint prediction of the object label and the action. A generative model of object similarities based on the Dirichlet distribution is proposed and embedded in the network for encoding the state of the system...
Deep learning has been the most popular feature learning method used for a variety of computer visio...
Machine learning models of visual action recognition are typically trained and tested on data from s...
Sufficient supervised information is crucial for any machine learning models to boost performance. H...
In this work, we examine the literature of active object recognition in the past and present. We not...
In recent years, great success has been achieved in visual object detection, which is one of the fun...
The topic of this thesis is the combination of active learning strategies used in conjunction with ...
In recent years, great success has been achieved in visual object detection, which is one of the fun...
Visual object detection is one of the fundamental tasks in computer vision and robotics. Small scale...
While there have been extensive applications deploying object detection, one of its limitations is t...
Active learning is a label-efficient machine learning method that actively selects the most valuable...
A novel deep neural network training paradigm that exploits the conjoint information in multiple het...
Current state-of-the-art deep learning systems for visual object recognition and detection use purel...
A mobile agent with the task to classify its sensor pattern has to cope with ambiguous information. ...
We propose drl-RPN, a deep reinforcement learning-based visual recognition model consisting of a seq...
Deep learning has been proven to be effective for classification problems. However, the majority of ...
Deep learning has been the most popular feature learning method used for a variety of computer visio...
Machine learning models of visual action recognition are typically trained and tested on data from s...
Sufficient supervised information is crucial for any machine learning models to boost performance. H...
In this work, we examine the literature of active object recognition in the past and present. We not...
In recent years, great success has been achieved in visual object detection, which is one of the fun...
The topic of this thesis is the combination of active learning strategies used in conjunction with ...
In recent years, great success has been achieved in visual object detection, which is one of the fun...
Visual object detection is one of the fundamental tasks in computer vision and robotics. Small scale...
While there have been extensive applications deploying object detection, one of its limitations is t...
Active learning is a label-efficient machine learning method that actively selects the most valuable...
A novel deep neural network training paradigm that exploits the conjoint information in multiple het...
Current state-of-the-art deep learning systems for visual object recognition and detection use purel...
A mobile agent with the task to classify its sensor pattern has to cope with ambiguous information. ...
We propose drl-RPN, a deep reinforcement learning-based visual recognition model consisting of a seq...
Deep learning has been proven to be effective for classification problems. However, the majority of ...
Deep learning has been the most popular feature learning method used for a variety of computer visio...
Machine learning models of visual action recognition are typically trained and tested on data from s...
Sufficient supervised information is crucial for any machine learning models to boost performance. H...