Address email We present an approximation scheme for solving Markov Decision Processes (MDPs) in which the states are embedded in a metric space. Our algorithm has log log n a time bound of Õ(n ɛ), where n is the number of states and ɛ the approximation factor. This bound is independent of the numerical size of the input and the discount factor and is hence strongly polynomial. We present the result for deterministic MDPs and discuss extensions to stochastic domains. Unlike most MDP algorithms, ours does not make local iterative updates, but uses a divide and conquer approach. It partitions the space into nested regions and considers only policies that transition between partitions through portals. A simple dynamic program over the partitio...
Colloque avec actes et comité de lecture. internationale.International audienceA new algorithm for s...
We provide a novel, flexible, iterative refinement algorithm to automatically construct an approxima...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
We present a provably near-optimal algorithm for reinforcement learn-ing in Markov decision processe...
We present metric- � , a provably near-optimal algorithm for reinforcement learning in Markov decisi...
We present a class of metrics, defined on the state space of a finite Markov decision process (MDP)...
We present metric- E3 a provably near-optimal algorithm for reinforcement learning in Markov decisio...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
Markov decision problems (MDPs) provide the foundations for a number of problems of interest to AI r...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...
This paper is about planning in stochastic domains by means of partially observable Markov decision...
Markov Decision Problems (MDPs) are the foundation for many problems that are of interest to researc...
We introduce a class of Markov decision problems (MDPs) which greatly simplify Reinforcement Learnin...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...
International audienceThe Markov Decision Process (MDP) framework is a tool for the efficient modell...
Colloque avec actes et comité de lecture. internationale.International audienceA new algorithm for s...
We provide a novel, flexible, iterative refinement algorithm to automatically construct an approxima...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...
We present a provably near-optimal algorithm for reinforcement learn-ing in Markov decision processe...
We present metric- � , a provably near-optimal algorithm for reinforcement learning in Markov decisi...
We present a class of metrics, defined on the state space of a finite Markov decision process (MDP)...
We present metric- E3 a provably near-optimal algorithm for reinforcement learning in Markov decisio...
This paper provides new techniques for abstracting the state space of a Markov Decision Process (MD...
Markov decision problems (MDPs) provide the foundations for a number of problems of interest to AI r...
Solving Markov decision processes (MDPs) efficiently is challenging in many cases, for example, when...
This paper is about planning in stochastic domains by means of partially observable Markov decision...
Markov Decision Problems (MDPs) are the foundation for many problems that are of interest to researc...
We introduce a class of Markov decision problems (MDPs) which greatly simplify Reinforcement Learnin...
I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, includ...
International audienceThe Markov Decision Process (MDP) framework is a tool for the efficient modell...
Colloque avec actes et comité de lecture. internationale.International audienceA new algorithm for s...
We provide a novel, flexible, iterative refinement algorithm to automatically construct an approxima...
Increasing attention has been paid to reinforcement learning algorithms in recent years, partly due ...