International audienceWe study one of the main concept of online learning and sequential decision problem known as regret minimization. We investigate three different frameworks, whether data are generated accordingly to some i.i.d. process, or when no assumption whatsoever are made on their generation and, finally, when they are the consequences of some sequential interactions between players. The overall objective is to provide a comprehensive introduction to this domain. In each of these main setups, we define and analyze classical algorithms and we analyze their performances. Finally, we also show that some concepts of equilibria that emerged in game theory are learnable by players using online learning schemes while some other concepts...
This paper explores a fundamental connection between computational learning theory and game theory t...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceFrom auctions to traffic planning and the training of adversarial neural nets,...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
We study one of the main concept of online learning and sequential decision problem known ...
Abstract. We study one of the main concept of online learning and sequential decision problem known ...
We study one of the main concept of online learning and sequential decision problem known ...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
We propose a novel online learning method for mini-mizing regret in large extensive-form games. The ...
This paper explores a fundamental connection between computational learning theory and game theory t...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceFrom auctions to traffic planning and the training of adversarial neural nets,...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
International audienceWe study one of the main concept of online learning and sequential decision pr...
We study one of the main concept of online learning and sequential decision problem known ...
Abstract. We study one of the main concept of online learning and sequential decision problem known ...
We study one of the main concept of online learning and sequential decision problem known ...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
We propose a novel online learning method for mini-mizing regret in large extensive-form games. The ...
This paper explores a fundamental connection between computational learning theory and game theory t...
International audienceIn game-theoretic learning, several agents are simultaneously following their ...
International audienceFrom auctions to traffic planning and the training of adversarial neural nets,...