We study one of the main concept of online learning and sequential decision problem known as regret minimization. We investigate three different frameworks, whether data are generated accordingly to some i.i.d. process, or when no assumption whatsoever are made on their generation and, finally, when they are the consequences of some sequential interactions between players. The overall objective is to provide a comprehensive introduction to this domain. In each of these main setups, we define and analyze classical algorithms and we analyze their performances. Finally, we also show that some concepts of equilibria that emerged in game theory are learnable by player...
This paper explores a fundamental connection between computational learning theory and game theory t...
In this paper, we study the learning problem in two-player general-sum Markov Games. We consider the...
In this paper, we study the learning problem in two-player general-sum Markov Games. We consider the...
We study one of the main concept of online learning and sequential decision problem known ...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
Abstract. We study one of the main concept of online learning and sequential decision problem known ...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
We propose a novel online learning method for mini-mizing regret in large extensive-form games. The ...
This paper explores a fundamental connection between computational learning theory and game theory t...
In this paper, we study the learning problem in two-player general-sum Markov Games. We consider the...
In this paper, we study the learning problem in two-player general-sum Markov Games. We consider the...
We study one of the main concept of online learning and sequential decision problem known ...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
Abstract. We study one of the main concept of online learning and sequential decision problem known ...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
We propose a novel online learning method for mini-mizing regret in large extensive-form games. The ...
This paper explores a fundamental connection between computational learning theory and game theory t...
In this paper, we study the learning problem in two-player general-sum Markov Games. We consider the...
In this paper, we study the learning problem in two-player general-sum Markov Games. We consider the...