Gaussian Process Dynamic Programming

Deisenroth, M.
Rasmussen, C.
Peters, J.

Open link

Publication date

March 2009

DOI

10.1016/j.neucom.2008.12.019

Publisher

Elsevier BV

Abstract

Reinforcement learning (RL) and optimal control of systems with contin- uous states and actions require approximation techniques in most interesting cases. In this article, we introduce Gaussian process dynamic programming (GPDP), an approximate value-function based RL algorithm. We consider both a classic optimal control problem, where problem-specific prior knowl- edge is available, and a classic RL problem, where only very general priors can be used. For the classic optimal control problem, GPDP models the unknown value functions with Gaussian processes and generalizes dynamic programming to continuous-valued states and actions. For the RL problem, GPDP starts from a given initial state and explores the state space using Bayesian active ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Gaussian Process Dynamic Programming

Abstract

Extracted data

Gaussian Process Dynamic Programming

Abstract

Extracted data

Related items

Related items