An important problem in reinforcement learning is the exploration-exploitation dilemma. Especially for environments with sparse or misleading rewards it has proven difficult to construct a good exploration strategy. For discrete domains good exploration strategies have been devised, but are often nontrivial to implement on more complex domains with continuous states and/or actions.In this work, a novel persistent and directed exploration framework is developed, called Smart Start. Usually, a reinforcement learning agent executes its learned policy with some exploration strategy from the start until the end of an episode, which we call ``normal'' learning. The idea of Smart Start is to split a reinforcement learning episode in two parts, the...
This thesis describes reinforcement learning (RL) methods which can solve sequential decision makin...
Reinforcement learning (RL) aims to learn optimal behaviors for agents to maximize cumulative reward...
Applying conventional reinforcement to complex domains requires the use of an overly simplified task...
Exploration plays a fundamental role in any active learning system. This study evaluates the role of...
Reinforcement learning with sparse rewards is still an open challenge. Classic methods rely on getti...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
This paper presents a model allowing to tune continual exploration in an optimal way by integrating ...
* This research was partially supported by the Latvian Science Foundation under grant No.02-86d.Effi...
Recent Reinforcement Learning (RL) algorithms, such as R-MAX, make (with high probability) only a sm...
International audienceRealistic environments often provide agents with very limited feedback. When t...
Abstract. Reinforcement learning (RL) involves sequential decision making in uncertain environments....
In this thesis, we discuss various techniques for improving exploration for deep reinforcement learn...
Typically, a reinforcement learning agent interacts with the environment and learns how to select an...
International audienceReinforcement learning (RL) is a paradigm for learning sequential decision mak...
Sparse reward is one of the biggest challenges in reinforcement learning (RL). In this paper, we pro...
This thesis describes reinforcement learning (RL) methods which can solve sequential decision makin...
Reinforcement learning (RL) aims to learn optimal behaviors for agents to maximize cumulative reward...
Applying conventional reinforcement to complex domains requires the use of an overly simplified task...
Exploration plays a fundamental role in any active learning system. This study evaluates the role of...
Reinforcement learning with sparse rewards is still an open challenge. Classic methods rely on getti...
Over the course of the last decade, the framework of reinforcement learning has developed into a pro...
This paper presents a model allowing to tune continual exploration in an optimal way by integrating ...
* This research was partially supported by the Latvian Science Foundation under grant No.02-86d.Effi...
Recent Reinforcement Learning (RL) algorithms, such as R-MAX, make (with high probability) only a sm...
International audienceRealistic environments often provide agents with very limited feedback. When t...
Abstract. Reinforcement learning (RL) involves sequential decision making in uncertain environments....
In this thesis, we discuss various techniques for improving exploration for deep reinforcement learn...
Typically, a reinforcement learning agent interacts with the environment and learns how to select an...
International audienceReinforcement learning (RL) is a paradigm for learning sequential decision mak...
Sparse reward is one of the biggest challenges in reinforcement learning (RL). In this paper, we pro...
This thesis describes reinforcement learning (RL) methods which can solve sequential decision makin...
Reinforcement learning (RL) aims to learn optimal behaviors for agents to maximize cumulative reward...
Applying conventional reinforcement to complex domains requires the use of an overly simplified task...