Humans are capable of perceiving a scene at a glance, and obtain deeper understanding with additional time. Computer visual recognition should be similarly robust to varying computational budgets --- a property we call Anytime recognition. We present a general method for learning dynamic policies to optimize Anytime performance in visual recognition. We approach this problem from the perspective of Markov Decision Processes, and use reinforcement learning techniques. Crucially, decisions are made at test time and depend on observed data and intermediate results. Our method is applicable to a wide variety of existing detectors and classifiers, as it learns from execution traces and requires no special knowledge of their implementation.We fir...
Humans learn robust and efficient strategies for visual tasks through interaction with their environ...
Attention is a highly important phenomenon emerging in infant development [1]. In human perception, ...
We propose drl-RPN, a deep reinforcement learning-based visual recognition model consisting of a seq...
Humans are capable of perceiving a scene at a glance, and obtain deeper understanding with additiona...
Structured prediction plays a central role in machine learning appli-cations from computational biol...
In most real-world information processing problems, data is not a free resource. Its acquisition is ...
Humans can quickly recognize objects in a dynamically changing world. This ability is showcased by t...
Abstract. In most real-world information processing problems, data is not a free resource; its acqui...
In many practical applications of machine learning data arrives sequentially over time in large chun...
This work proposes to learn visual encodings of attention patterns that enables sequential attention...
This dissertation is a computational investigation of the task of locating and recognizing objects i...
Object recognition in the real world is a big challenge in the field of computer vision. Given the p...
We consider an attention-based model that recognizes objects via a sequence of glimpses, and analyze...
Applying convolutional neural networks to large images is computationally ex-pensive because the amo...
Abstract. The innovation of this work is the provision of a system that learns visual encodings of a...
Humans learn robust and efficient strategies for visual tasks through interaction with their environ...
Attention is a highly important phenomenon emerging in infant development [1]. In human perception, ...
We propose drl-RPN, a deep reinforcement learning-based visual recognition model consisting of a seq...
Humans are capable of perceiving a scene at a glance, and obtain deeper understanding with additiona...
Structured prediction plays a central role in machine learning appli-cations from computational biol...
In most real-world information processing problems, data is not a free resource. Its acquisition is ...
Humans can quickly recognize objects in a dynamically changing world. This ability is showcased by t...
Abstract. In most real-world information processing problems, data is not a free resource; its acqui...
In many practical applications of machine learning data arrives sequentially over time in large chun...
This work proposes to learn visual encodings of attention patterns that enables sequential attention...
This dissertation is a computational investigation of the task of locating and recognizing objects i...
Object recognition in the real world is a big challenge in the field of computer vision. Given the p...
We consider an attention-based model that recognizes objects via a sequence of glimpses, and analyze...
Applying convolutional neural networks to large images is computationally ex-pensive because the amo...
Abstract. The innovation of this work is the provision of a system that learns visual encodings of a...
Humans learn robust and efficient strategies for visual tasks through interaction with their environ...
Attention is a highly important phenomenon emerging in infant development [1]. In human perception, ...
We propose drl-RPN, a deep reinforcement learning-based visual recognition model consisting of a seq...