Safe exploration is a common problem in reinforcement learning (RL) that aims to prevent agents from making disastrous decisions while exploring their environment. A family of approaches to this problem assume domain knowledge in the form of a (partial) model of this environment to decide upon the safety of an action. A so-called shield forces the RL agent to select only safe actions. However, for adoption in various applications, one must look beyond enforcing safety and also ensure the applicability of RL with good performance. We extend the applicability of shields via tight integration with state-of-the-art deep RL, and provide an extensive, empirical study in challenging, sparse-reward environments under partial observability. We show ...
Reinforcement learning (RL) agents need to explore their environments in order to learn optimal poli...
In safe Reinforcement Learning (RL), the agent attempts to find policies which maximize the expectat...
Reinforcement Learning (RL) agents can solve general problems based on little to no knowledge of the...
This paper concerns the efficient construction of a safety shield for reinforcement learning. We spe...
Reinforcement learning algorithms discover policies that maximize reward, but do not necessarily gua...
Reinforcement learning (RL) has shown great potential for solving complex tasks in a variety of doma...
Despite the successes of deep reinforcement learning (RL), it is still challenging to obtain safe po...
Deploying reinforcement learning (RL) involves major concerns around safety. Engineering a reward si...
This paper presents the concept of an adaptive safe padding that forces Reinforcement Learning (RL) ...
Reinforcement learning is an increasingly popular framework that enables robots to learn to perform ...
Safety comes first in many real-world applications involving autonomous agents. Despite a large numb...
Reinforcement learning (RL) is a general method for agents to learn optimal control policies through...
In safe Reinforcement Learning (RL), the agent attempts to find policies which maximize the expectat...
In the past few years, there has been much research in the field of Autonomous Vehicles (AV). If AVs...
In the past few years, there has been much research in the field of Autonomous Vehicles (AV). If AVs...
Reinforcement learning (RL) agents need to explore their environments in order to learn optimal poli...
In safe Reinforcement Learning (RL), the agent attempts to find policies which maximize the expectat...
Reinforcement Learning (RL) agents can solve general problems based on little to no knowledge of the...
This paper concerns the efficient construction of a safety shield for reinforcement learning. We spe...
Reinforcement learning algorithms discover policies that maximize reward, but do not necessarily gua...
Reinforcement learning (RL) has shown great potential for solving complex tasks in a variety of doma...
Despite the successes of deep reinforcement learning (RL), it is still challenging to obtain safe po...
Deploying reinforcement learning (RL) involves major concerns around safety. Engineering a reward si...
This paper presents the concept of an adaptive safe padding that forces Reinforcement Learning (RL) ...
Reinforcement learning is an increasingly popular framework that enables robots to learn to perform ...
Safety comes first in many real-world applications involving autonomous agents. Despite a large numb...
Reinforcement learning (RL) is a general method for agents to learn optimal control policies through...
In safe Reinforcement Learning (RL), the agent attempts to find policies which maximize the expectat...
In the past few years, there has been much research in the field of Autonomous Vehicles (AV). If AVs...
In the past few years, there has been much research in the field of Autonomous Vehicles (AV). If AVs...
Reinforcement learning (RL) agents need to explore their environments in order to learn optimal poli...
In safe Reinforcement Learning (RL), the agent attempts to find policies which maximize the expectat...
Reinforcement Learning (RL) agents can solve general problems based on little to no knowledge of the...