One of the most fundamental problems in Markov decision processes is analysis and control synthesis for safety and reachability specifications. We consider the stochastic reach-avoid problem, in which the objective is to synthesize a control policy to maximize the probability of reaching a target set at a given time, while staying in a safe set at all prior times. We characterize the solution to this problem through an infinite dimensional linear program. We then develop a tractable approximation to the infinite dimensional linear program through finite dimensional approximations of the decision space and constraints. For a large class of Markov decision processes modeled by Gaussian mixtures kernels we show that through a proper selection ...
We consider the infinite dimensional linear programming (inf-LP) approach for solving stochastic con...
Linear Programming is known to be an important and useful tool for solving Markov Decision Processes...
We derive a new expectation maximization algorithm for policy optimization in linear Gaussian Markov...
This article deals with stochastic processes endowed with the Markov (memoryless) property and evolv...
We consider discrete-time Markov decision processes in which the decision maker is interested in lon...
We consider the problem of controlling a Markov decision process (MDP) with a large state space, so ...
We present a dynamic programming based solution to a stochastic reachability problem for a controlle...
In this article we approach a class of stochastic reachability problems with state constraints from ...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
Abstract We consider the problem of controlling a Markov decision process (MDP) with a large state s...
Abstract. We develop a novel framework for formulating a class of stochastic reachability problems w...
Abstract. We propose a novel Galerkin discretization scheme for stochastic optimal control problems ...
We introduce a class of Markov decision problems (MDPs) which greatly simplify Reinforcement Learnin...
AbstractVerification of reachability properties for probabilistic systems is usually based on varian...
We consider the infinite dimensional linear programming (inf-LP) approach for solving stochastic con...
Linear Programming is known to be an important and useful tool for solving Markov Decision Processes...
We derive a new expectation maximization algorithm for policy optimization in linear Gaussian Markov...
This article deals with stochastic processes endowed with the Markov (memoryless) property and evolv...
We consider discrete-time Markov decision processes in which the decision maker is interested in lon...
We consider the problem of controlling a Markov decision process (MDP) with a large state space, so ...
We present a dynamic programming based solution to a stochastic reachability problem for a controlle...
In this article we approach a class of stochastic reachability problems with state constraints from ...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
We study the problem of learning controllers for discrete-time non-linear stochastic dynamical syste...
Abstract We consider the problem of controlling a Markov decision process (MDP) with a large state s...
Abstract. We develop a novel framework for formulating a class of stochastic reachability problems w...
Abstract. We propose a novel Galerkin discretization scheme for stochastic optimal control problems ...
We introduce a class of Markov decision problems (MDPs) which greatly simplify Reinforcement Learnin...
AbstractVerification of reachability properties for probabilistic systems is usually based on varian...
We consider the infinite dimensional linear programming (inf-LP) approach for solving stochastic con...
Linear Programming is known to be an important and useful tool for solving Markov Decision Processes...
We derive a new expectation maximization algorithm for policy optimization in linear Gaussian Markov...