Plannimg with an adaptive world model

Sebastian B. Thrun
Knut Möller
Alexander Linden

Publication date

January 1991

Abstract

We present a new connectionist planning method [TML90]. By interaction with an unknown environment, a world model is progressively constructed using gradient descent. For deriving optimal actions with respect to future reinforcement, planning is applied in two steps: an experience network proposes a plan which is subsequently optimized by gradient descent with a chain of world models, so that an optimal reinforcement maybe obtained when it is actually run. The appropriateness of this method is demonstrated by a robotics application and a pole balancing task

Extracted data

We use cookies to provide a better user experience.

Data Protection

Plannimg with an adaptive world model

Abstract

Extracted data

Plannimg with an adaptive world model

Abstract

Extracted data

Related items

Related items