As safety violations can lead to severe consequences in real-world robotic applications, the increasing deployment of Reinforcement Learning (RL) in robotic domains has propelled the study of safe exploration for reinforcement learning (safe RL). In this work, we propose a risk preventive training method for safe RL, which learns a statistical contrastive classifier to predict the probability of a state-action pair leading to unsafe states. Based on the predicted risk probabilities, we can collect risk preventive trajectories and reshape the reward function with risk penalties to induce safe RL policies. We conduct experiments in robotic simulation environments. The results show the proposed approach has comparable performance with the stat...
Task learning in robotics requires repeatedly executing the same actions in different states to lear...
Deep Reinforcement Learning (RL) has shown promise in addressing complex robotic challenges. In real...
Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training...
Reinforcement learning is an increasingly popular framework that enables robots to learn to perform ...
Reinforcement learning (RL) algorithms can achieve state-of-the-art performance in decision-making a...
This paper concerns the efficient construction of a safety shield for reinforcement learning. We spe...
Reinforcement learning (RL) algorithms can achieve state-of-the-art performance in decision-making a...
This work introducesPolicy Reuse for Safe Reinforcement Learning, an algorithm that combines Probabi...
This work introducesPolicy Reuse for Safe Reinforcement Learning, an algorithm that combines Probabi...
This work introducesPolicy Reuse for Safe Reinforcement Learning, an algorithm that combines Probabi...
During training, reinforcement learning systems interact with the world without considering the safe...
Safety comes first in many real-world applications involving autonomous agents. Despite a large numb...
Deploying reinforcement learning (RL) involves major concerns around safety. Engineering a reward si...
Task learning in robotics requires repeatedly executing the same actions in different states to lear...
This dissertation proposes and presents solutions to two new problems that fall within the broad sco...
Task learning in robotics requires repeatedly executing the same actions in different states to lear...
Deep Reinforcement Learning (RL) has shown promise in addressing complex robotic challenges. In real...
Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training...
Reinforcement learning is an increasingly popular framework that enables robots to learn to perform ...
Reinforcement learning (RL) algorithms can achieve state-of-the-art performance in decision-making a...
This paper concerns the efficient construction of a safety shield for reinforcement learning. We spe...
Reinforcement learning (RL) algorithms can achieve state-of-the-art performance in decision-making a...
This work introducesPolicy Reuse for Safe Reinforcement Learning, an algorithm that combines Probabi...
This work introducesPolicy Reuse for Safe Reinforcement Learning, an algorithm that combines Probabi...
This work introducesPolicy Reuse for Safe Reinforcement Learning, an algorithm that combines Probabi...
During training, reinforcement learning systems interact with the world without considering the safe...
Safety comes first in many real-world applications involving autonomous agents. Despite a large numb...
Deploying reinforcement learning (RL) involves major concerns around safety. Engineering a reward si...
Task learning in robotics requires repeatedly executing the same actions in different states to lear...
This dissertation proposes and presents solutions to two new problems that fall within the broad sco...
Task learning in robotics requires repeatedly executing the same actions in different states to lear...
Deep Reinforcement Learning (RL) has shown promise in addressing complex robotic challenges. In real...
Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training...