Approximate dynamic programming : projected equation and aggregation methods for Tetris

Hwang, Daw-sen

Publication date

January 2011

Publisher

Massachusetts Institute of Technology

Abstract

Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2011.Cataloged from PDF version of thesis.Includes bibliographical references (p. 65-67).In this thesis, we survey approximate dynamic programming (ADP) methods and test the methods with the game of Tetris. We focus on ADP methods where the cost-to- go function J is approximated with [phi]r, where [phi] is some matrix and r is a vector with relatively low dimension. There are two major categories of methods: projected equation methods and aggregation methods. In projected equation methods, the cost-to-go function approximation [phi]r is updated by simulation using one of several policy-updated algorithms such as LSTD([lambda]) [BB96],...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Approximate dynamic programming : projected equation and aggregation methods for Tetris

Abstract

Extracted data

Approximate dynamic programming : projected equation and aggregation methods for Tetris

Abstract

Extracted data

Related items

Related items