Generalization in Reinforcement Learning with a Task- Related World Description using Rules

Rri I
Alejandro Agostini
Enric Celaya
Ro Agostini
Enric Celaya

Publication date

January 2006

Abstract

Abstract. A Reinforcement Learning problem is formulated as trying to find the action policy that maximizes the accumulated reward received by the agent through time. One of the most popular algorithms used in RL is Q-Learning which uses an action-value function q(s,a) to evaluate the expectation of the maximum future cumulative reward that will be obtained from executing action a in situation s. Q-Learning, as well as conventional RL techniques, is defined for discrete environments with a finite set of states and actions. The action-value function is explicitly represented by storing values for each state-action (s,a) pair. In order to reach a good approximation of the value function all the (s,a) pairs must be experienced many times but i...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Generalization in Reinforcement Learning with a Task- Related World Description using Rules

Abstract

Extracted data

Generalization in Reinforcement Learning with a Task- Related World Description using Rules

Abstract

Extracted data

Related items

Related items