Many physical systems have underlying safety considerations that require that the policy employed ensures the satisfaction of a set of constraints. The analytical formulation usually takes the form of a Constrained Markov Decision Process (CMDP). We focus on the case where the CMDP is unknown, and RL algorithms obtain samples to discover the model and compute an optimal constrained policy. Our goal is to characterize the relationship between safety constraints and the number of samples needed to ensure a desired level of accuracy---both objective maximization and constraint satisfaction---in a PAC sense. We explore two classes of RL algorithms, namely, (i) a generative model based approach, wherein samples are taken initially to estimate...
Reinforcement Learning (RL) in finite state and action Markov Decision Processes is studied with an ...
The Robust Markov Decision Process (RMDP) framework focuses on designing control policies that are r...
Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied...
The general assumption in reinforcement learning(RL) that agents are free to explore for searching o...
In reinforcement learning (RL), an agent must explore an initially unknown environment in order to l...
This paper concerns the efficient construction of a safety shield for reinforcement learning. We spe...
In areas where prior data is not available, data scientists use reinforcement learning (RL) on onlin...
Deploying reinforcement learning (RL) involves major concerns around safety. Engineering a reward si...
www.cs.tu-berlin.de\∼geibel Abstract. In this article, I will consider Markov Decision Processes wit...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
We present a reinforcement learning approach to explore and optimize a safety-constrained Markov Dec...
In reinforcement learning (RL), an agent must explore an initially unknown environment in order to l...
We consider the challenge of policy simplification and verification in the context of policies learn...
In this work we address the problem of finding feasible policies for Constrained Markov Decision Pro...
Abstract. Reinforcement learning means finding the optimal course of action in Markovian environment...
Reinforcement Learning (RL) in finite state and action Markov Decision Processes is studied with an ...
The Robust Markov Decision Process (RMDP) framework focuses on designing control policies that are r...
Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied...
The general assumption in reinforcement learning(RL) that agents are free to explore for searching o...
In reinforcement learning (RL), an agent must explore an initially unknown environment in order to l...
This paper concerns the efficient construction of a safety shield for reinforcement learning. We spe...
In areas where prior data is not available, data scientists use reinforcement learning (RL) on onlin...
Deploying reinforcement learning (RL) involves major concerns around safety. Engineering a reward si...
www.cs.tu-berlin.de\∼geibel Abstract. In this article, I will consider Markov Decision Processes wit...
In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are thos...
We present a reinforcement learning approach to explore and optimize a safety-constrained Markov Dec...
In reinforcement learning (RL), an agent must explore an initially unknown environment in order to l...
We consider the challenge of policy simplification and verification in the context of policies learn...
In this work we address the problem of finding feasible policies for Constrained Markov Decision Pro...
Abstract. Reinforcement learning means finding the optimal course of action in Markovian environment...
Reinforcement Learning (RL) in finite state and action Markov Decision Processes is studied with an ...
The Robust Markov Decision Process (RMDP) framework focuses on designing control policies that are r...
Reinforcement Learning (RL) is a widely employed machine learning architecture that has been applied...