Linearly-solvable Markov decision problems

Emanuel Todorov

Publication date

September 2014

Abstract

We introduce a class of Markov decision problems (MDPs) which greatly simplify Reinforcement Learning. These MDPs have discrete state spaces and continuous control spaces. The controls have the effect of scaling the transition probabilities of an underlying Markov chain. A control cost penalizing KL divergence between controlled and uncontrolled transition probabilities makes the minimization problem convex and allows analytical computation of the optimal controls given the optimal value function. An exponential transformation of the optimal value function makes the minimized Bellman equation linear. Apart from their theoretical signi cance, the new MDPs enable ef cient approximations to traditional MDPs. Shortest path problems are approxim...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Linearly-solvable Markov decision problems

Abstract

Extracted data

Linearly-solvable Markov decision problems

Abstract

Extracted data

Related items

Related items