Scaling Up Reinforcement Learning through Targeted Exploration

Mann, Timothy
Choe, Yoonsuck

Publication date

August 2011

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Abstract

Recent Reinforcement Learning (RL) algorithms, such as R-MAX, make (with high probability) only a small number of poor decisions. In practice, these algorithms do not scale well as the number of states grows because the algorithms spend too much effort exploring. We introduce an RL algorithm State TArgeted R-MAX (STAR-MAX) that explores a subset of the state space, called the exploration envelope ξ. When ξ equals the total state space, STAR-MAX behaves identically to R-MAX. When ξ is a subset of the state space, to keep exploration within ξ, a recovery rule β is needed. We compared existing algorithms with our algorithm employing various exploration envelopes. With an appropriate choice of ξ, STAR-MAX scales far better t...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Scaling Up Reinforcement Learning through Targeted Exploration

Abstract

Extracted data

Scaling Up Reinforcement Learning through Targeted Exploration

Abstract

Extracted data

Related items

Related items