© Springer-Verlag Berlin Heidelberg 2001. The strength of a game-playing program is mainly based on the adequacy of the evaluation function and the efficacy of the search algorithm. This paper investigates how temporal difference learning and genetic algorithms can be used to improve various decisions made during game-tree search. The existent TD algorithms are not directly suitable for learning search decisions. Therefore we propose a modified update rule that uses the TD error of the evaluation function to shorten the lag between two rewards. The genetic algorithms can be applied directly to learn search decisions. For our experiments we selected the problem of time allocation from the set of search decisions. On each move the player can ...
In this paper we present TDLeaf(), a variation on the TD() algorithm that enables it to be used in c...
The game of Go has a high branching factor that defeats the tree search approach used in computer ch...
NeuroDraughts is a draughts playing program similar in approach to NeuroGammon and NeuroChess [Tesau...
© Springer-Verlag Berlin Heidelberg 2001. The strength of a game-playing program is mainly based on ...
Traditional search methods have always suffered from both spatial and temporal expansion, especiall...
Game-tree search is the engine behind many computer game opponents. Traditional game-tree search alg...
In this paper we present TDLeaf(), a variation on the TD() algorithm that enables it to be used in c...
Temporal-difference learning is one of the most successful and broadly applied solutions to the rein...
Reinforcement learning is applied to computer-based playing of 5x5 Go. We have found that incorporat...
Hybridization of global and local search techniques has already produced promising results in the fi...
Computers have developed to the point where searching through a large set of data to find an optimum...
This paper describes a methodology for quickly learning to play games at a strong level. The methodo...
We present an experimental methodology and results for a machine learning approach to learning openi...
Computers have developed to the point where searching through a large set of data to find an optimum...
In this thesis we develop a unified framework for reinforcement learning and simulation-based search...
In this paper we present TDLeaf(), a variation on the TD() algorithm that enables it to be used in c...
The game of Go has a high branching factor that defeats the tree search approach used in computer ch...
NeuroDraughts is a draughts playing program similar in approach to NeuroGammon and NeuroChess [Tesau...
© Springer-Verlag Berlin Heidelberg 2001. The strength of a game-playing program is mainly based on ...
Traditional search methods have always suffered from both spatial and temporal expansion, especiall...
Game-tree search is the engine behind many computer game opponents. Traditional game-tree search alg...
In this paper we present TDLeaf(), a variation on the TD() algorithm that enables it to be used in c...
Temporal-difference learning is one of the most successful and broadly applied solutions to the rein...
Reinforcement learning is applied to computer-based playing of 5x5 Go. We have found that incorporat...
Hybridization of global and local search techniques has already produced promising results in the fi...
Computers have developed to the point where searching through a large set of data to find an optimum...
This paper describes a methodology for quickly learning to play games at a strong level. The methodo...
We present an experimental methodology and results for a machine learning approach to learning openi...
Computers have developed to the point where searching through a large set of data to find an optimum...
In this thesis we develop a unified framework for reinforcement learning and simulation-based search...
In this paper we present TDLeaf(), a variation on the TD() algorithm that enables it to be used in c...
The game of Go has a high branching factor that defeats the tree search approach used in computer ch...
NeuroDraughts is a draughts playing program similar in approach to NeuroGammon and NeuroChess [Tesau...