We introduce a class of MPDs which greatly simplify Reinforcement Learning. They have discrete state spaces and continuous control spaces. The controls have the effect of rescaling the transition probabilities of an underlying Markov chain. A control cost penalizing KL divergence between controlled and uncontrolled transition probabilities makes the minimization problem convex, and allows analytical computation of the optimal controls given the optimal value function. An exponential transformation of the optimal value function makes the minimized Bellman equation linear. Apart from their theoretical signi cance, the new MDPs enable ef cient approximations to traditional MDPs. Shortest path problems are approximated to arbitrary precision wi...
Address email We present an approximation scheme for solving Markov Decision Processes (MDPs) in whi...
We present a framework to address a class of sequential decision making problems. Our framework feat...
The problem of making optimal decisions in uncertain conditions is central to Artificial Intelligenc...
We introduce a class of MPDs which greatly simplify Reinforcement Learning. They have discrete state...
University of Minnesota M.S. thesis. June 2012. Major: Computer science. Advisor: Prof. Paul Schrate...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
We present a hierarchical reinforcement learning framework that formulates each task in the hierarch...
www.cs.tu-berlin.de\∼geibel Abstract. In this article, I will consider Markov Decision Processes wit...
In this paper, we study the problem of transferring the available Markov Decision Process (MDP) mode...
Optimal control provides an appealing machinery to complete complicated control tasks with limited p...
We study reinforcement learning (RL) with linear function approximation. For episodic time-inhomogen...
Semi-Markov Decision Problems are continuous time generalizations of discrete time Markov Decision P...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
We propose a novel approach for solving continuous and hybrid Markov Decision Processes (MDPs) based...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...
Address email We present an approximation scheme for solving Markov Decision Processes (MDPs) in whi...
We present a framework to address a class of sequential decision making problems. Our framework feat...
The problem of making optimal decisions in uncertain conditions is central to Artificial Intelligenc...
We introduce a class of MPDs which greatly simplify Reinforcement Learning. They have discrete state...
University of Minnesota M.S. thesis. June 2012. Major: Computer science. Advisor: Prof. Paul Schrate...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
We present a hierarchical reinforcement learning framework that formulates each task in the hierarch...
www.cs.tu-berlin.de\∼geibel Abstract. In this article, I will consider Markov Decision Processes wit...
In this paper, we study the problem of transferring the available Markov Decision Process (MDP) mode...
Optimal control provides an appealing machinery to complete complicated control tasks with limited p...
We study reinforcement learning (RL) with linear function approximation. For episodic time-inhomogen...
Semi-Markov Decision Problems are continuous time generalizations of discrete time Markov Decision P...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
We propose a novel approach for solving continuous and hybrid Markov Decision Processes (MDPs) based...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...
Address email We present an approximation scheme for solving Markov Decision Processes (MDPs) in whi...
We present a framework to address a class of sequential decision making problems. Our framework feat...
The problem of making optimal decisions in uncertain conditions is central to Artificial Intelligenc...