In this short paper we prove that the classic simplex method with the most-negative-reduced-cost pivoting rule (Dantzig 1947) for solving the Markov decision problem (MDP) with a fixed discount rate is a strongly polynomial-time algorithm. The result seems surprising since this very pivoting rule was shown to be exponential for solving a general linear programming (LP) problem, and the simplex (or simple policy iteration) method with the smallest-index pivoting rule was shown to be expo-nential for solving an MDP problem regardless of discount rates. As a corollary, the policy-iteration method (Howard 1960) is also a strongly polynomial-time algorithm for solving the MDP with a fixed discount rate.
Value iteration is a commonly used and em-pirically competitive method in solving many Markov decisi...
We present a new complexity result on solving the Markov decision problem (MDP) with n states and a ...
The question whether the Simplex method admits a polynomial time pivot rule remains one of the most ...
We prove that the simplex method with the highest gain/most-negative-reduced cost pivoting rule conv...
The simplex method is a well-studied and widely-used pivoting method for solving linear programs. Wh...
The question of knowing whether the Policy Iteration algorithm (PI) for solving Markov Decision Proc...
We study the problem of computing the optimal value function for a Markov decision process with posi...
Abstract: "We study the problem of computing the optimal value function for a Markov decision proces...
We study the problem of computing the optimal value function for a Markov decision process with posi...
We study the problem of computing the optimal value function for a Markov decision process with posi...
The Simplex algorithm is one of the most important algorithms in discrete optimization, and is the m...
We study the problem of computing the optimal value function for a Markov decision process with posi...
The Simplex algorithm is one of the most important algorithms in discrete optimization, and is the m...
The question whether the Simplex method admits a polynomial time pivot rule remains one of the most ...
The question whether the Simplex method admits a polynomial time pivot rule remains one of the most ...
Value iteration is a commonly used and em-pirically competitive method in solving many Markov decisi...
We present a new complexity result on solving the Markov decision problem (MDP) with n states and a ...
The question whether the Simplex method admits a polynomial time pivot rule remains one of the most ...
We prove that the simplex method with the highest gain/most-negative-reduced cost pivoting rule conv...
The simplex method is a well-studied and widely-used pivoting method for solving linear programs. Wh...
The question of knowing whether the Policy Iteration algorithm (PI) for solving Markov Decision Proc...
We study the problem of computing the optimal value function for a Markov decision process with posi...
Abstract: "We study the problem of computing the optimal value function for a Markov decision proces...
We study the problem of computing the optimal value function for a Markov decision process with posi...
We study the problem of computing the optimal value function for a Markov decision process with posi...
The Simplex algorithm is one of the most important algorithms in discrete optimization, and is the m...
We study the problem of computing the optimal value function for a Markov decision process with posi...
The Simplex algorithm is one of the most important algorithms in discrete optimization, and is the m...
The question whether the Simplex method admits a polynomial time pivot rule remains one of the most ...
The question whether the Simplex method admits a polynomial time pivot rule remains one of the most ...
Value iteration is a commonly used and em-pirically competitive method in solving many Markov decisi...
We present a new complexity result on solving the Markov decision problem (MDP) with n states and a ...
The question whether the Simplex method admits a polynomial time pivot rule remains one of the most ...