This paper explores a fundamental connection between computational learning theory and game theory through a property we call no-Φ-regret. Given a set of transformations Φ (i.e., mappings from actions to actions), a learning algorithm is said to exhibit no Φ-regret if an agent experiences no regret for playing the actions the algorithm prescribes, rather than playing the transformed actions prescribed by any of the elements of Φ. The existence of no-Φ-regret learning algorithms is established, for all finite Φ. Analogously, a class of game-theoretic equilibria, called � Φ-equilibria, for � Φ = (Φi)1≤i≤n, is defined (here n is the number of agents/players). The main contribution of this paper is to show that that the empirical distribution o...
We propose a novel online learning method for mini-mizing regret in large extensive-form games. The ...
International audienceUnderstanding the behavior of no-regret dynamics in general N-player games is ...
Abstract. We study one of the main concept of online learning and sequential decision problem known ...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
We study one of the main concept of online learning and sequential decision problem known ...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
We study one of the main concept of online learning and sequential decision problem known ...
International audienceWe study one of the main concept of online learning and sequential decision pr...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
We propose a novel online learning method for mini-mizing regret in large extensive-form games. The ...
International audienceUnderstanding the behavior of no-regret dynamics in general N-player games is ...
Abstract. We study one of the main concept of online learning and sequential decision problem known ...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
We study one of the main concept of online learning and sequential decision problem known ...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
We study one of the main concept of online learning and sequential decision problem known ...
International audienceWe study one of the main concept of online learning and sequential decision pr...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
We propose a novel online learning method for mini-mizing regret in large extensive-form games. The ...
International audienceUnderstanding the behavior of no-regret dynamics in general N-player games is ...
Abstract. We study one of the main concept of online learning and sequential decision problem known ...