Sparse Bayesian reinforcement learning

Lee, Minwoo

Publication date

January 2017

Publisher

Colorado State University. Libraries

Abstract

2017 Summer.Includes bibliographical references.This dissertation presents knowledge acquisition and retention methods for efficient and robust learning. We propose a framework for learning and memorizing, and we examine how we can use the memory for efficient machine learning. Temporal difference (TD) learning is a core part of reinforcement learning, and it requires function approximation. However, with function approximation, the most popular TD methods such as TD(λ), SARSA, and Q-learning lose stability and diverge especially when the complexity of the problem grows and the sampling distribution is biased. The biased samples cause function approximators such as neural networks to respond quickly to the new data by losing what was previo...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Sparse Bayesian reinforcement learning

Abstract

Extracted data

Sparse Bayesian reinforcement learning

Abstract

Extracted data

Related items

Related items