Safety is a critical hurdle that limits the application of deep reinforcement learning to real-world control tasks. To this end, constrained reinforcement learning leverages cost functions to improve safety in constrained Markov decision process. However, constrained methods fail to achieve zero violation even when the cost limit is zero. This paper analyzes the reason for such failure, which suggests that a proper cost function plays an important role in constrained RL. Inspired by the analysis, we propose AutoCost, a simple yet effective framework that automatically searches for cost functions that help constrained RL to achieve zero-violation performance. We validate the proposed method and the searched cost function on the safety benchm...
Reinforcement learning (RL) for robotics is challenging due to the difficulty in hand-engineering a ...
We consider the safe reinforcement learning (RL) problem of maximizing utility with extremely low co...
The general assumption in reinforcement learning(RL) that agents are free to explore for searching o...
Safety is a critical hurdle that limits the application of deep reinforcement learning (RL) to real-...
Reinforcement learning (RL) agents need to explore their environments in order to learn optimal poli...
One approach to guaranteeing safety in Reinforcement Learning is through cost constraints that are d...
Deploying reinforcement learning (RL) involves major concerns around safety. Engineering a reward si...
Safety comes first in many real-world applications involving autonomous agents. Despite a large numb...
Many physical systems have underlying safety considerations that require that the policy employed en...
We study the problem of safe offline reinforcement learning (RL), the goal is to learn a policy that...
In reinforcement learning (RL), an agent must explore an initially unknown environment in order to l...
In safe Reinforcement Learning (RL), the agent attempts to find policies which maximize the expectat...
In safe Reinforcement Learning (RL), the agent attempts to find policies which maximize the expectat...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
Reinforcement learning (RL) for robotics is challenging due to the difficulty in hand-engineering a ...
We consider the safe reinforcement learning (RL) problem of maximizing utility with extremely low co...
The general assumption in reinforcement learning(RL) that agents are free to explore for searching o...
Safety is a critical hurdle that limits the application of deep reinforcement learning (RL) to real-...
Reinforcement learning (RL) agents need to explore their environments in order to learn optimal poli...
One approach to guaranteeing safety in Reinforcement Learning is through cost constraints that are d...
Deploying reinforcement learning (RL) involves major concerns around safety. Engineering a reward si...
Safety comes first in many real-world applications involving autonomous agents. Despite a large numb...
Many physical systems have underlying safety considerations that require that the policy employed en...
We study the problem of safe offline reinforcement learning (RL), the goal is to learn a policy that...
In reinforcement learning (RL), an agent must explore an initially unknown environment in order to l...
In safe Reinforcement Learning (RL), the agent attempts to find policies which maximize the expectat...
In safe Reinforcement Learning (RL), the agent attempts to find policies which maximize the expectat...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
Reinforcement learning (RL) has been widely used, for example, in robotics, recommendation systems, ...
Reinforcement learning (RL) for robotics is challenging due to the difficulty in hand-engineering a ...
We consider the safe reinforcement learning (RL) problem of maximizing utility with extremely low co...
The general assumption in reinforcement learning(RL) that agents are free to explore for searching o...