Exploration in Metric State Spaces Sham Kakade

Michael Kearns
John Langford

Publication date

February 2008

Abstract

We present metric- � , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows the construction of accurate local models. The algorithm is a generalization of the � algorithm of Kearns and Singh, and assumes a black box for approximate planning. Unlike the original � , metric-� finds a near optimal policy in an amount of time that does not directly depend on the size of the state space, but instead depends on the covering number of the state space. Informally, the covering number is the number of neighborhoods required for accurate local modeling. 1 Introduction, Motivatio

Extracted data

We use cookies to provide a better user experience.

Data Protection

Exploration in Metric State Spaces Sham Kakade

Abstract

Extracted data

Exploration in Metric State Spaces Sham Kakade

Abstract

Extracted data

Related items

Related items