Discounted-sum games provide a formal model for the study of reinforcement learning, where the agent is enticed to get rewards early since later rewards are discounted. When the agent interacts with the environment, she may realize that, with hindsight, she could have increased her reward by playing differently: this difference in outcomes constitutes her regret value. The agent may thus elect to follow a regret- minimal strategy. In this paper, it is shown that (1) there always exist regret-minimal strategies that are admissible—a strategy being inadmissible if there is another strategy that always performs better; (2) computing the minimum possible regret or checking that a strategy is regret-minimal can be done in, disregarding the compu...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We examine the problem of regret minimization when the learner is involved in a continuous game with...
We propose a novel online learning method for mini-mizing regret in large extensive-form games. The ...
In this paper, we study the problem of minimizing regret in discounted-sum games played on weighted ...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
AbstractNo-regret is described as one framework that game theorists and computer scientists have con...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
International audienceWe examine the problem of regret minimization when the learner is involved in ...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceWe examine the problem of regret minimization when the learner is involved in ...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We examine the problem of regret minimization when the learner is involved in a continuous game with...
We propose a novel online learning method for mini-mizing regret in large extensive-form games. The ...
In this paper, we study the problem of minimizing regret in discounted-sum games played on weighted ...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
AbstractNo-regret is described as one framework that game theorists and computer scientists have con...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
International audienceWe examine the problem of regret minimization when the learner is involved in ...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceWe examine the problem of regret minimization when the learner is involved in ...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
Regret-minimizing strategies for repeated games have been receiving increasing attention in the lite...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We examine the problem of regret minimization when the learner is involved in a continuous game with...
We propose a novel online learning method for mini-mizing regret in large extensive-form games. The ...