Feature Search in the Grassmanian in Online Reinforcement Learning

BHATNAGAR, S
BORKAR, VS
PRABUCHANDRAN, KJ

Open link

Publication date

January 2013

DOI

10.1109/JSTSP.2013.2255022

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

We consider the problem of finding the best features for value function approximation in reinforcement learning and develop an online algorithm to optimize the mean square Bellman error objective. For any given feature value, our algorithm performs gradient search in the parameter space via a residual gradient scheme and, on a slower timescale, also performs gradient search in the Grassman manifold of features. We present a proof of convergence of our algorithm. We show empirical results using our algorithm as well as a similar algorithm that uses temporal difference learning in place of the residual gradient scheme for the faster timescale updates

Extracted data

We use cookies to provide a better user experience.

Data Protection

Feature Search in the Grassmanian in Online Reinforcement Learning

Abstract

Extracted data

Feature Search in the Grassmanian in Online Reinforcement Learning

Abstract

Extracted data

Related items

Related items