Training and Evaluation of Deep Policies Using Reinforcement Learning and Generative Models

Ghadirzadeh, Ali
Poklukar, Petra
Arndt, Karol
Finn, Chelsea
Kyrki, Ville
Kragic, Danica
Björkman, Mårten

Open link

Publication date

August 2022

Publisher

MICROTOME PUBL

Journal

Journal of Machine Learning Research

Abstract

We present a data-efficient framework for solving sequential decision-making problems which exploits the combination of reinforcement learning (RL) and latent variable generative models. The framework, called GenRL, trains deep policies by introducing an action latent variable such that the feed-forward policy search can be divided into two parts: (i) training a sub-policy that outputs a distribution over the action latent variable given a state of the system, and (ii) unsupervised training of a generative model that outputs a sequence of motor actions conditioned on the latent action variable. GenRL enables safe exploration and alleviates the data-inefficiency problem as it exploits prior knowledge about valid sequences of motor actions. M...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Training and Evaluation of Deep Policies Using Reinforcement Learning and Generative Models

Abstract

Extracted data

Training and Evaluation of Deep Policies Using Reinforcement Learning and Generative Models

Abstract

Extracted data

Related items

Related items