Reinforcement learning through global stochastic search in N-MDPs

Matteo, Leonetti
IOCCHI, Luca
Ramamoorthy, Subramanian

Open link

Publication date

January 2011

DOI

10.1007/978-3-642-23783-6_21

Publisher

Springer Science and Business Media LLC

Abstract

Reinforcement Learning (RL) in either fully or partially observable domains usually poses a requirement on the knowledge representation in order to be sound: the underlying stochastic process must be Markovian. In many applications, including those involving interactions between multiple agents (e.g., humans and robots), sources of uncertainty affect rewards and transition dynamics in such a way that a Markovian representation would be computationally very expensive. An alternative formulation of the decision problem involves partially specified behaviors with choice points. While this reduces the complexity of the policy space that must be explored - something that is crucial for realistic autonomous agents that must bound search time - it...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Reinforcement learning through global stochastic search in N-MDPs

Abstract

Extracted data

Reinforcement learning through global stochastic search in N-MDPs

Abstract

Extracted data

Related items

Related items