Efficient Reinforcement Learning with Bayesian Optimization

Ganjali, Danyan

Publication date

January 2016

Publisher

eScholarship, University of California

Abstract

A probabilistic reinforcement learning algorithm is presented for finding control policies in continuous state and action spaces without a prior knowledge of the dynamics. The objective of this algorithm is to learn from minimal amount of interaction with the environment in order to maximize a notion of reward, i.e. a numerical measure of the quality of the resulting state trajectories. Experience from the interactions are used to construct a set of probabilistic Gaussian process (GP) models that predict the resulting state trajectories and the reward from executing a policy on the system. These predictions are used with a technique known as Bayesian optimization to search for policies that promise higher rewards. As more experience is gath...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Efficient Reinforcement Learning with Bayesian Optimization

Abstract

Extracted data

Efficient Reinforcement Learning with Bayesian Optimization

Abstract

Extracted data

Related items

Related items