A number of learning problems can be cast as an Online Convex Game: on each round, a learner makes a prediction x from a convex set, the environment plays a loss function f, and the learner’s long-term goal is to minimize regret. Algorithms have been proposed by Zinkevich, when f is assumed to be convex, and Hazan et al., when f is assumed to be strongly convex, that have provably low regret. We consider these two settings and analyze such games from a minimax perspective, proving minimax strategies and lower bounds in each case. These results prove that the existing algorithms are essentially optimal
We study the performance of an online learner under a framework in which it receives partial informa...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We study the rates of growth of the regret in online convex optimization. First, we show that a simp...
A number of learning problems can be cast as an Online Convex Game: on each round, a learner makes a...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
Much of the work in online learning focuses on the study of sublinear upper bounds on the regret. In...
The framework of online learning with memory naturally captures learning problems with temporal effe...
We introduce an online convex optimization algorithm which utilizes projected subgradient descent wi...
This thesis studies three problems in sequential decision making across two different frameworks. T...
We study the rates of growth of the regret in online convex optimization. First, we show that a simp...
© 2017 Neural information processing systems foundation. All rights reserved. We study a variant of ...
We design and analyze minimax-optimal algorithms for online linear optimization games where the play...
We study online optimization in a setting where an online learner seeks to optimize a per-round hitt...
This paper describes a general framework for converting online game playing algorithms into constrai...
We study the performance of an online learner under a framework in which it receives partial informa...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We study the rates of growth of the regret in online convex optimization. First, we show that a simp...
A number of learning problems can be cast as an Online Convex Game: on each round, a learner makes a...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
We study the regret of optimal strategies for online convex optimization games. Using von Neumann's ...
Much of the work in online learning focuses on the study of sublinear upper bounds on the regret. In...
The framework of online learning with memory naturally captures learning problems with temporal effe...
We introduce an online convex optimization algorithm which utilizes projected subgradient descent wi...
This thesis studies three problems in sequential decision making across two different frameworks. T...
We study the rates of growth of the regret in online convex optimization. First, we show that a simp...
© 2017 Neural information processing systems foundation. All rights reserved. We study a variant of ...
We design and analyze minimax-optimal algorithms for online linear optimization games where the play...
We study online optimization in a setting where an online learner seeks to optimize a per-round hitt...
This paper describes a general framework for converting online game playing algorithms into constrai...
We study the performance of an online learner under a framework in which it receives partial informa...
We study the problem of online learning with a notion of regret defined with respect to a set of str...
We study the rates of growth of the regret in online convex optimization. First, we show that a simp...