Problems of sequential decision making under uncertainty are common inmanufacturing, computer and communication systems, and many such problems canbe formulated as Markov Decision Processes (MDPs). Motivated by a capacityexpansion and allocation problem in semiconductor manufacturing, we formulatea fab-level decision making problem using a finite-horizon transient MDPmodel that can integrate life cycle dynamics of the fab and provide atrade-off between immediate and future benefits and costs.However, for large and complicated systems formulated as MDPs, the classicalmethodology to compute optimal policies, dynamic programming, suffers fromthe so-called "curse of dimensionality" (computational requirementincreases exponentially with number o...
We propose various computational schemes for solving Partially Observable Markov Decision Processes...
Wedevelopasimulation-based,two-timescale actorcritic algorithm for infinite horizon Markov decision p...
We develop extensions of the Simulated Annealing with Multiplicative Weights (SAMW) algorithm that p...
In this paper, we give a summary of recent development of simulation-based algorithmsfor average cos...
In this paper, we discuss implementation issues of applying a simulation-based approach to asemicond...
Markov decision process (MDP) models are widely used for modeling sequential decision-making problem...
In Chapter 2, we propose several two-timescale simulation-based actor-critic algorithms for solution...
In this paper, we propose an approximate policy iteration (API) algorithm for asemiconductor fab-lev...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
This article proposes a three-timescale simulation based algorithm for solution of infinite horizon ...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
This research focuses on Markov Decision Processes (MDP). MDP is one of the most important and chall...
In this paper, we propose an approximate policy iteration (API) algorithm for a semiconduc-tor fab-l...
Cover title. "February 1998."Includes bibliographical references (p. 16-17).Supported by Siemens AG,...
We propose a new method for learning policies for large, partially observable Markov decision proces...
We propose various computational schemes for solving Partially Observable Markov Decision Processes...
Wedevelopasimulation-based,two-timescale actorcritic algorithm for infinite horizon Markov decision p...
We develop extensions of the Simulated Annealing with Multiplicative Weights (SAMW) algorithm that p...
In this paper, we give a summary of recent development of simulation-based algorithmsfor average cos...
In this paper, we discuss implementation issues of applying a simulation-based approach to asemicond...
Markov decision process (MDP) models are widely used for modeling sequential decision-making problem...
In Chapter 2, we propose several two-timescale simulation-based actor-critic algorithms for solution...
In this paper, we propose an approximate policy iteration (API) algorithm for asemiconductor fab-lev...
This chapter presents an overview of simulation-based techniques useful for solving Markov decision ...
This article proposes a three-timescale simulation based algorithm for solution of infinite horizon ...
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer...
This research focuses on Markov Decision Processes (MDP). MDP is one of the most important and chall...
In this paper, we propose an approximate policy iteration (API) algorithm for a semiconduc-tor fab-l...
Cover title. "February 1998."Includes bibliographical references (p. 16-17).Supported by Siemens AG,...
We propose a new method for learning policies for large, partially observable Markov decision proces...
We propose various computational schemes for solving Partially Observable Markov Decision Processes...
Wedevelopasimulation-based,two-timescale actorcritic algorithm for infinite horizon Markov decision p...
We develop extensions of the Simulated Annealing with Multiplicative Weights (SAMW) algorithm that p...