Learning Time Allocation using Neural Networks

Kocsis, L.
Uiterwijk, J.
van den Herik, Hendrik

Open link

Publication date

January 2001

DOI

10.1007/3-540-45579-5_11

Publisher

Springer

Abstract

© Springer-Verlag Berlin Heidelberg 2001. The strength of a game-playing program is mainly based on the adequacy of the evaluation function and the efficacy of the search algorithm. This paper investigates how temporal difference learning and genetic algorithms can be used to improve various decisions made during game-tree search. The existent TD algorithms are not directly suitable for learning search decisions. Therefore we propose a modified update rule that uses the TD error of the evaluation function to shorten the lag between two rewards. The genetic algorithms can be applied directly to learn search decisions. For our experiments we selected the problem of time allocation from the set of search decisions. On each move the player can ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Learning Time Allocation using Neural Networks

Abstract

Extracted data

Learning Time Allocation using Neural Networks

Abstract

Extracted data

Related items

Related items