Numerous learning problems that contain exploration, such as experiment design, multiarm bandits, online routing, search result aggregation and many more, have been studied extensively in isolation. In this paper we consider a generic and efficiently computable method for action space exploration based on convex geometry. We define a novel geometric notion of an exploration mechanism with low variance called volumetric spanners, and give efficient algorithms to construct such spanners. We describe applications of this mechanism to the problem of optimal experiment design and the general framework for decision making under uncertainty of bandit linear optimization. For the latter we give efficient and near-optimal regret algorithm over gener...
We consider the decision-making framework of online convex optimization with a very large number of ...
In this paper we present a new methodology for robot learning that combines ideas from statistical g...
No-regret algorithms for online convex optimization are potent online learning tools and have been d...
Numerous machine learning problems require an exploration basis- a mechanism to explore the action s...
Numerous machine learning problems require an exploration basis- a mechanism to explore the action s...
We study the control of an \emph{unknown} linear dynamical system under general convex costs. The ob...
We provide the first algorithm for online bandit linear optimization whose regret after T rounds is ...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Barycentric spanners have been used as an efficient exploration basis in online linear optimization ...
The world is structured in countless ways. It may be prudent to enforce corresponding structural pro...
This monograph presents the main mathematical ideas in convex opti-mization. Starting from the funda...
Many important optimization problems, such as the minimum spanning tree and minimum-cost flow, can b...
International audienceWe consider online bandit learning in which at every time step, an algorithm h...
We consider online convex optimizations in the bandit setting. The decision maker does not know the ...
© 2017 Neural information processing systems foundation. All rights reserved. We study a variant of ...
We consider the decision-making framework of online convex optimization with a very large number of ...
In this paper we present a new methodology for robot learning that combines ideas from statistical g...
No-regret algorithms for online convex optimization are potent online learning tools and have been d...
Numerous machine learning problems require an exploration basis- a mechanism to explore the action s...
Numerous machine learning problems require an exploration basis- a mechanism to explore the action s...
We study the control of an \emph{unknown} linear dynamical system under general convex costs. The ob...
We provide the first algorithm for online bandit linear optimization whose regret after T rounds is ...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Barycentric spanners have been used as an efficient exploration basis in online linear optimization ...
The world is structured in countless ways. It may be prudent to enforce corresponding structural pro...
This monograph presents the main mathematical ideas in convex opti-mization. Starting from the funda...
Many important optimization problems, such as the minimum spanning tree and minimum-cost flow, can b...
International audienceWe consider online bandit learning in which at every time step, an algorithm h...
We consider online convex optimizations in the bandit setting. The decision maker does not know the ...
© 2017 Neural information processing systems foundation. All rights reserved. We study a variant of ...
We consider the decision-making framework of online convex optimization with a very large number of ...
In this paper we present a new methodology for robot learning that combines ideas from statistical g...
No-regret algorithms for online convex optimization are potent online learning tools and have been d...