Optimization-based Approximate Dynamic Programming

Petrik, Marek

Open PDF

Open link

Publication date

September 2010

Publisher

ScholarWorks@UMass Amherst

Language

English

Abstract

Reinforcement learning algorithms hold promise in many complex domains, such as resource management and planning under uncertainty. Most reinforcement learning algorithms are iterative - they successively approximate the solution based on a set of samples and features. Although these iterative algorithms can achieve impressive results in some domains, they are not sufficiently reliable for wide applicability; they often require extensive parameter tweaking to work well and provide only weak guarantees of solution quality. Some of the most interesting reinforcement learning algorithms are based on approximate dynamic programming (ADP). ADP, also known as value function approximation, approximates the value of being in each state. This thesis...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Optimization-based Approximate Dynamic Programming

Abstract

Extracted data

Optimization-based Approximate Dynamic Programming

Abstract

Extracted data

Related items

Related items