In this paper, we propose an approximate policy iteration (API) algorithm for a semiconduc-tor fab-level decision making problem. This problem is formulated as a discounted cost Markov Decision Process (MDP), and we have applied exact policy iteration to solve a simple example in prior work [1]. However, the overwhelming computational requirements of exact policy iter-ation prevent its application for larger problems. Approximate policy iteration overcomes this obstacle by approximating the cost-to-go using function approximation. Numerical simulation on the same example shows that the proposed API algorithm leads to a policy with cost close to that of the optimal policy
In this paper, we give a summary of recent development of simulation-based algorithmsfor average cos...
Solving Markov Decision Processes is a recurrent task in engineering which can be performed efficien...
In this paper, we discuss implementation issues of applying a simulation-based approach to asemicond...
In this paper, we propose an approximate policy iteration (API) algorithm for asemiconductor fab-lev...
There are different timescales of decision making in semiconductor fabs. While decisions on buying/d...
Problems of sequential decision making under uncertainty are common inmanufacturing, computer and co...
In this paper we study a class of modified policy iteration algorithms for solving Markov decision p...
We explore approximate policy iteration, replacing the usual costfunction learning step with a learn...
Absfmct- There are dimerent timescales of decision making in semiconductor fabs. While decisions on ...
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new...
This research focuses on Markov Decision Processes (MDP). MDP is one of the most important and chall...
Markov decision processes (MDP) [1] provide a mathe-matical framework for studying a wide range of o...
AbstractQ-Learning is based on value iteration and remains the most popular choice for solving Marko...
Simulation-based policy iteration (SBPI) is a modification of the policy iteration algorithm for com...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
In this paper, we give a summary of recent development of simulation-based algorithmsfor average cos...
Solving Markov Decision Processes is a recurrent task in engineering which can be performed efficien...
In this paper, we discuss implementation issues of applying a simulation-based approach to asemicond...
In this paper, we propose an approximate policy iteration (API) algorithm for asemiconductor fab-lev...
There are different timescales of decision making in semiconductor fabs. While decisions on buying/d...
Problems of sequential decision making under uncertainty are common inmanufacturing, computer and co...
In this paper we study a class of modified policy iteration algorithms for solving Markov decision p...
We explore approximate policy iteration, replacing the usual costfunction learning step with a learn...
Absfmct- There are dimerent timescales of decision making in semiconductor fabs. While decisions on ...
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new...
This research focuses on Markov Decision Processes (MDP). MDP is one of the most important and chall...
Markov decision processes (MDP) [1] provide a mathe-matical framework for studying a wide range of o...
AbstractQ-Learning is based on value iteration and remains the most popular choice for solving Marko...
Simulation-based policy iteration (SBPI) is a modification of the policy iteration algorithm for com...
We consider the problem of finding an optimal policy in a Markov decision process that maximises the...
In this paper, we give a summary of recent development of simulation-based algorithmsfor average cos...
Solving Markov Decision Processes is a recurrent task in engineering which can be performed efficien...
In this paper, we discuss implementation issues of applying a simulation-based approach to asemicond...