Numerous machine learning problems require an exploration basis- a mechanism to explore the action space. We define a novel geometric notion of exploration basis with low variance called volumetric spanners, and give efficient algorithms to construct such bases. We show how efficient volumetric spanners give rise to an efficient and near-optimal regret algorithm for bandit linear optimization over general convex sets. Previously such results were known only for specific convex sets, or under special conditions such as the existence of an efficient self-concordant barrier for the underlying set
The dissertation addresses the research topics of machine learning outlined below. We developed the ...
We consider the decision-making framework of online convex optimization with a very large number of ...
We develop a greedy algorithm for the basis-pursuit problem. Thealgorithm is empirically found to pr...
Numerous machine learning problems require an exploration basis- a mechanism to explore the action s...
Numerous learning problems that contain exploration, such as experiment design, multiarm bandits, on...
We provide the first algorithm for online bandit linear optimization whose regret after T rounds is ...
Barycentric spanners have been used as an efficient exploration basis in online linear optimization ...
Linear bandit algorithms yield $\tilde{\mathcal{O}}(n\sqrt{T})$ pseudo-regret bounds on compact conv...
The world is structured in countless ways. It may be prudent to enforce corresponding structural pro...
Consider the online convex optimization problem, in which a player has to choose ac-tions iterativel...
We study the control of an \emph{unknown} linear dynamical system under general convex costs. The ob...
We study stochastic linear payoff bandit prob-lems and give a simple, computationally ef-ficient alg...
We extend the well-known BFGS quasi-Newton method and its memory-limited variant LBFGS to the optimi...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Abstract—Linear optimization is many times algorithmi-cally simpler than non-linear convex optimizat...
The dissertation addresses the research topics of machine learning outlined below. We developed the ...
We consider the decision-making framework of online convex optimization with a very large number of ...
We develop a greedy algorithm for the basis-pursuit problem. Thealgorithm is empirically found to pr...
Numerous machine learning problems require an exploration basis- a mechanism to explore the action s...
Numerous learning problems that contain exploration, such as experiment design, multiarm bandits, on...
We provide the first algorithm for online bandit linear optimization whose regret after T rounds is ...
Barycentric spanners have been used as an efficient exploration basis in online linear optimization ...
Linear bandit algorithms yield $\tilde{\mathcal{O}}(n\sqrt{T})$ pseudo-regret bounds on compact conv...
The world is structured in countless ways. It may be prudent to enforce corresponding structural pro...
Consider the online convex optimization problem, in which a player has to choose ac-tions iterativel...
We study the control of an \emph{unknown} linear dynamical system under general convex costs. The ob...
We study stochastic linear payoff bandit prob-lems and give a simple, computationally ef-ficient alg...
We extend the well-known BFGS quasi-Newton method and its memory-limited variant LBFGS to the optimi...
Inspired by advertising markets, we consider large-scale sequential decision making problems in whic...
Abstract—Linear optimization is many times algorithmi-cally simpler than non-linear convex optimizat...
The dissertation addresses the research topics of machine learning outlined below. We developed the ...
We consider the decision-making framework of online convex optimization with a very large number of ...
We develop a greedy algorithm for the basis-pursuit problem. Thealgorithm is empirically found to pr...