Absfmct- There are dimerent timescales of decision making in semiconductor fabs. While decisions on buyingldiscarding of machines are made on the slower timescale, those that deal with capacity allocation and switchover are made on the faster timescale. We formulate this problem along the lines of a recently developed multi-time scale Markov decision pro-cess (MMDP) framework and present numerical experiments wherein we use TD(0) and Q-learning algorithms with linear approximation architecture, and show comparisons of these with the policy iteration algorithm. We show numerical exper-iments under two different scenarios. In the first, transition probabilities are computed and used in the algorithms. In the second, transitions are simulated ...
We present a novel multi-timescale Q-learning algorithm for average cost control in a Markov decisio...
Semi-Markov Decision Problems are continuous time generalizations of discrete time Markov Decision P...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...
There are different timescales of decision making in semiconductor fabs. While decisions on buying/d...
Many computational problems can be solved by multiple algorithms, with different algorithms fastest ...
This paper proposes a simple analytical model called M time-scale MarkovDecision Process (MMDP) for ...
We consider the problem of control of hierarchical Markov decision processes and develop a simulatio...
In this paper, we propose an approximate policy iteration (API) algorithm for a semiconduc-tor fab-l...
We develop extensions of the Simulated Annealing with Multiplicative Weights (SAMW) algorithm that p...
International audience21 st century has seen a lot of progress, especially in robotics. Today, the e...
Problems of sequential decision making under uncertainty are common inmanufacturing, computer and co...
We develop a simulation based algorithm for finite horizon Markov decision processes with finite sta...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
This article proposes several two-timescale simulation-based actor-critic algorithms for solution of...
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new...
We present a novel multi-timescale Q-learning algorithm for average cost control in a Markov decisio...
Semi-Markov Decision Problems are continuous time generalizations of discrete time Markov Decision P...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...
There are different timescales of decision making in semiconductor fabs. While decisions on buying/d...
Many computational problems can be solved by multiple algorithms, with different algorithms fastest ...
This paper proposes a simple analytical model called M time-scale MarkovDecision Process (MMDP) for ...
We consider the problem of control of hierarchical Markov decision processes and develop a simulatio...
In this paper, we propose an approximate policy iteration (API) algorithm for a semiconduc-tor fab-l...
We develop extensions of the Simulated Annealing with Multiplicative Weights (SAMW) algorithm that p...
International audience21 st century has seen a lot of progress, especially in robotics. Today, the e...
Problems of sequential decision making under uncertainty are common inmanufacturing, computer and co...
We develop a simulation based algorithm for finite horizon Markov decision processes with finite sta...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
This article proposes several two-timescale simulation-based actor-critic algorithms for solution of...
We consider the classical finite-state discounted Markovian decision problem, and we introduce a new...
We present a novel multi-timescale Q-learning algorithm for average cost control in a Markov decisio...
Semi-Markov Decision Problems are continuous time generalizations of discrete time Markov Decision P...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...