Regret minimization is a powerful tool for solving large-scale extensive-form games. State-of-the-art methods rely on minimizing regret locally at each decision point. In this work we derive a new framework for regret minimization on sequential decision problems and extensive-form games with general compact convex sets at each decision point and general convex losses, as opposed to prior work which has been for simplex decision points and linear losses. We call our framework laminar regret decomposition. It generalizes the CFR algorithm to this more general setting. Furthermore, our framework enables a new proof of CFR even in the known setting, which is derived from a perspective of decomposing polytope regret, thereby leading to an arguab...
Sequential decision-making with multiple agents and imperfect information is commonly modeled as an ...
Sequential decision-making with multiple agents and imperfect information is commonly modeled as an ...
Consider the online convex optimization problem, in which a player has to choose ac-tions iterativel...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
A natural algorithmic scheme in online game playing is called ‘follow-the-leader’, first proposed by...
Regret matching is a widely-used algorithm for learning how to act. We begin by proving that regrets...
We extend the classic regret minimization framework for approximating equilibria in normal-form game...
Regret matching is a widely-used algorithm for learning how to act. We begin by proving that regrets...
Extensive-form games are a common model for multiagent interactions with imperfect information. In t...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
We develop an algorithmic framework for solving convex optimization problems using no-regret game dy...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Counterfactual regret minimization (CFR) is a family of iterative algorithms that are the most popul...
No-regret algorithms for online convex optimization are potent online learning tools and have been d...
We propose a computationally efficient random walk on a convex body which rapidly mixes to a time-va...
Sequential decision-making with multiple agents and imperfect information is commonly modeled as an ...
Sequential decision-making with multiple agents and imperfect information is commonly modeled as an ...
Consider the online convex optimization problem, in which a player has to choose ac-tions iterativel...
We propose a novel online learning method for minimizing regret in large extensive-form games. The a...
A natural algorithmic scheme in online game playing is called ‘follow-the-leader’, first proposed by...
Regret matching is a widely-used algorithm for learning how to act. We begin by proving that regrets...
We extend the classic regret minimization framework for approximating equilibria in normal-form game...
Regret matching is a widely-used algorithm for learning how to act. We begin by proving that regrets...
Extensive-form games are a common model for multiagent interactions with imperfect information. In t...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
We develop an algorithmic framework for solving convex optimization problems using no-regret game dy...
Many situations involve repeatedly making decisions in an uncertain environment: for instance, decid...
Counterfactual regret minimization (CFR) is a family of iterative algorithms that are the most popul...
No-regret algorithms for online convex optimization are potent online learning tools and have been d...
We propose a computationally efficient random walk on a convex body which rapidly mixes to a time-va...
Sequential decision-making with multiple agents and imperfect information is commonly modeled as an ...
Sequential decision-making with multiple agents and imperfect information is commonly modeled as an ...
Consider the online convex optimization problem, in which a player has to choose ac-tions iterativel...