Scalable and effective exploration remains a key challenge in reinforcement learning (RL). While there are methods with optimality guarantees in the setting of discrete state and action spaces, these methods cannot be applied in high-dimensional deep RL scenarios. As such, most contemporary RL relies on simple heuristics such as epsilon-greedy exploration or adding Gaussian noise to the controls. This paper introduces Variational Information Maximizing Exploration (VIME), an exploration strategy based on maximization of information gain about the agent's belief of environment dynamics. We propose a practical implementation, using variational inference in Bayesian neural networks which efficiently handles continuous state and action spaces. ...
Bayesian reinforcement learning (RL) offers a principled and elegant approach for sequential decisio...
Many real-world sequential decision making problems are partially observable by nature, and the envi...
A fundamental concern in the deployment of artificial agents in real-life is their capacity to quick...
Scalable and effective exploration remains a key challenge in reinforcement learning (RL). While the...
Reinforcement learning is a powerful approach for learning control policies that solve sequential de...
This thesis explores Bayesian and variational inference in the context of solving the reinforcement ...
The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about the envir...
Reinforcement learning (RL) agents need to explore their environments in order to learn optimal poli...
This paper presents a model allowing to tune continual exploration in an optimal way by integrating ...
This paper presents a framework allowing to tune continual exploration in an optimal way. It first q...
Reinforcement learning has shown great potential in generalizing over raw sensory data using only a ...
Abstract. This paper presents a framework allowing to tune continual explo-ration in an optimal way....
Sparse reward is one of the biggest challenges in reinforcement learning (RL). In this paper, we pro...
A fundamental challenge for reinforcement learning (RL) is how to achieve effcient exploration in in...
Trading off exploration and exploitation in an unknown environment is key to maximising expected onl...
Bayesian reinforcement learning (RL) offers a principled and elegant approach for sequential decisio...
Many real-world sequential decision making problems are partially observable by nature, and the envi...
A fundamental concern in the deployment of artificial agents in real-life is their capacity to quick...
Scalable and effective exploration remains a key challenge in reinforcement learning (RL). While the...
Reinforcement learning is a powerful approach for learning control policies that solve sequential de...
This thesis explores Bayesian and variational inference in the context of solving the reinforcement ...
The impetus for exploration in reinforcement learning (RL) is decreasing uncertainty about the envir...
Reinforcement learning (RL) agents need to explore their environments in order to learn optimal poli...
This paper presents a model allowing to tune continual exploration in an optimal way by integrating ...
This paper presents a framework allowing to tune continual exploration in an optimal way. It first q...
Reinforcement learning has shown great potential in generalizing over raw sensory data using only a ...
Abstract. This paper presents a framework allowing to tune continual explo-ration in an optimal way....
Sparse reward is one of the biggest challenges in reinforcement learning (RL). In this paper, we pro...
A fundamental challenge for reinforcement learning (RL) is how to achieve effcient exploration in in...
Trading off exploration and exploitation in an unknown environment is key to maximising expected onl...
Bayesian reinforcement learning (RL) offers a principled and elegant approach for sequential decisio...
Many real-world sequential decision making problems are partially observable by nature, and the envi...
A fundamental concern in the deployment of artificial agents in real-life is their capacity to quick...