International audiencePolicy Iteration is an algorithm for the exact solving of optimization and game theory problems, formulated as equations on min max affine expressions. It has been shown that the problem of finding the least fixpoint of semantic equations on some abstract domains can be reduced to such optimization problems. This enables the use of Policy Iteration to solve such equations, instead of the traditional Kleene iteration that performs approximations to ensure convergence. We first show in this paper that the concept of Policy Iteration can be integrated into numerical abstract domains in a generic way. This allows to widen considerably its applicability in static analysis. We then consider the verification of programs manip...
The abstract interpretation is a general method to compute automatically program invariants. This me...
Modified policy iteration (MPI) is a dynamic programming (DP) algorithm that contains the two celebr...
Approximate policy iteration is a class of reinforcement learning (RL) algorithms where the policy i...
International audiencePolicy Iteration is an algorithm for the exact solving of optimization and gam...
International audienceStrategy iteration methods are used for solving fixed point equations. It has ...
Abstract. Strategy iteration methods are used for solving fixed point equations. It has been shown t...
Strategy iteration methods are used for solving fixed point equations. It has been shown that they i...
This paper presents a study of the policy improvement step that can be usefully exploited by approxi...
AbstractWe prove in this paper that policy iteration can be generally defined in finite domain of te...
The policy iteration method is a classical algorithm for solving optimal control problems. In this p...
Strategy iteration is a technique frequently used for two-player games in order to determine the win...
We consider the problem of learning discounted-cost optimal control policies for unknown determinist...
We explore approximate policy iteration, replacing the usual costfunction learning step with a learn...
Abstract. We present an accelerated algorithm for the solution of static Hamilton-Jacobi-Bellman equ...
We consider the discrete-time infinite-horizon optimal control problem formalized by Markov de-cisio...
The abstract interpretation is a general method to compute automatically program invariants. This me...
Modified policy iteration (MPI) is a dynamic programming (DP) algorithm that contains the two celebr...
Approximate policy iteration is a class of reinforcement learning (RL) algorithms where the policy i...
International audiencePolicy Iteration is an algorithm for the exact solving of optimization and gam...
International audienceStrategy iteration methods are used for solving fixed point equations. It has ...
Abstract. Strategy iteration methods are used for solving fixed point equations. It has been shown t...
Strategy iteration methods are used for solving fixed point equations. It has been shown that they i...
This paper presents a study of the policy improvement step that can be usefully exploited by approxi...
AbstractWe prove in this paper that policy iteration can be generally defined in finite domain of te...
The policy iteration method is a classical algorithm for solving optimal control problems. In this p...
Strategy iteration is a technique frequently used for two-player games in order to determine the win...
We consider the problem of learning discounted-cost optimal control policies for unknown determinist...
We explore approximate policy iteration, replacing the usual costfunction learning step with a learn...
Abstract. We present an accelerated algorithm for the solution of static Hamilton-Jacobi-Bellman equ...
We consider the discrete-time infinite-horizon optimal control problem formalized by Markov de-cisio...
The abstract interpretation is a general method to compute automatically program invariants. This me...
Modified policy iteration (MPI) is a dynamic programming (DP) algorithm that contains the two celebr...
Approximate policy iteration is a class of reinforcement learning (RL) algorithms where the policy i...