A new reinforcement learning algorithm with fixed exploration for semi-Markov decision processes

Encapera, Angelo Michael

Open PDF

Open link

Publication date

January 2017

Publisher

Scholars\u27 Mine

Language

English

Abstract

Artificial intelligence or machine learning techniques are currently being widely applied for solving problems within the field of data analytics. This work presents and demonstrates the use of a new machine learning algorithm for solving semi-Markov decision processes (SMDPs). SMDPs are encountered in the domain of Reinforcement Learning to solve control problems in discrete-event systems. The new algorithm developed here is called iSMART, an acronym for imaging Semi-Markov Average Reward Technique. The algorithm uses a constant exploration rate, unlike its precursor R-SMART, which required exploration decay. The major difference between R-SMART and iSMART is that the latter uses, in addition to the regular iterates of R-SMART, a set of so...

Extracted data

We use cookies to provide a better user experience.

Data Protection

A new reinforcement learning algorithm with fixed exploration for semi-Markov decision processes

Abstract

Extracted data

A new reinforcement learning algorithm with fixed exploration for semi-Markov decision processes

Abstract

Extracted data

Related items

Related items