Reinforcement Learning Algorithms as Function Optimizers

Ronald J. Williams
Jing Peng

Open link

Publication date

January 1989

DOI

10.1109/ijcnn.1989.118683

Publisher

IEEE

Abstract

Any nonassociative reinforcement learning algorithm can be viewed as a method for performing function optimization through (possibly noise-corrupted) sampling of function values. We describe the results of simulations in which the optima of several deterministic functions studied by Ackley [1] were sought using variants of REINFORCE algorithms [19], [20]. Results obtained for certain of these algorithms compare favorably to the best results found by Ackley

Extracted data

We use cookies to provide a better user experience.

Data Protection

Reinforcement Learning Algorithms as Function Optimizers

Abstract

Extracted data

Reinforcement Learning Algorithms as Function Optimizers

Abstract

Extracted data

Related items

Related items