Trade-Offs in Sampling-Based Adversarial Planning

Ramanujan, Raghuram
Selman, Bart

Open link

Publication date

March 2011

DOI

10.1609/icaps.v21i1.13472

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

The Upper Confidence bounds for Trees (UCT) algorithm has in recent years captured the attention of the planning and game-playing community due to its notable success in the game of Go. However, attempts to reproduce similar levels of performance in domains that are the forte of Minimax-style algorithms have been largely unsuccessful, making any comparative studies of the two hard. In this paper, we study UCT in the game of Mancala, which to our knowledge is the first domain where both search algorithms perform quite well with minimal enhancement. We focus on the three key components of the UCT algorithm in its purest form - targeted node expansion, state value estimation via playouts and averaging backups - and look at their contributions ...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Trade-Offs in Sampling-Based Adversarial Planning

Abstract

Extracted data

Trade-Offs in Sampling-Based Adversarial Planning

Abstract

Extracted data

Related items

Related items