Incremental Stochastic Factorization for Online Reinforcement Learning

Barreto, Andre
Beirigo, Rafael
Pineau, Joelle
Precup, Doina

Publication date

February 2016

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Abstract

A construct that has been receiving attention recently in reinforcement learning is stochastic factorization (SF), a particular case of non-negative factorization (NMF) in which the matrices involved are stochastic. The idea is to use SF to approximate the transition matrices of a Markov decision process (MDP). This is useful for two reasons. First, learning the factors of the SF instead of the transition matrices can reduce significantly the number of parameters to be estimated. Second, it has been shown that SF can be used to reduce the number of operations needed to compute an MDP's value function. Recently, an algorithm called expectation-maximization SF (EMSF) has been proposed to compute a SF directly from transitions sampled from an ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Incremental Stochastic Factorization for Online Reinforcement Learning

Abstract

Extracted data

Incremental Stochastic Factorization for Online Reinforcement Learning

Abstract

Extracted data

Related items

Related items