We study the control of an \emph{unknown} linear dynamical system under general convex costs. The objective is minimizing regret vs the class of strongly-stable linear policies. In this work, we first consider the case of known cost functions, for which we design the first polynomial-time algorithm with n 3 √ T -regret, where n is the dimension of the state plus the dimension of control input. The √ T -horizon dependence is optimal, and improves upon the previous best known bound of T 2 / 3 . The main component of our algorithm is a novel geometric exploration strategy: we adaptively construct a sequence of barycentric spanners in an over-parameterized policy space. Second, we consider the case of bandit feedback, for which we give the firs...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
In this paper, we show that for arbitrary stochastic linear dynamical systems, the problem of optimi...
Thesis (Ph.D.)--University of Washington, 2023This dissertation makes contributions to decision-maki...
We consider the problem of controlling an unknown linear dynamical system under a stochastic convex ...
In the last century, the problem of controlling a dynamical system has been a core component in nume...
We consider the problem of controlling an unknown linear dynamical system under adversarially changi...
We propose an algorithm based on online convex optimization for controlling discrete-time linear dyn...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We provide the first algorithm for online bandit linear optimization whose regret after T rounds is ...
We consider online convex optimizations in the bandit setting. The decision maker does not know the ...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
Consider the online convex optimization problem, in which a player has to choose ac-tions iterativel...
Bandit convex optimization is a special case of online convex optimization with partial information....
We consider the fundamental problem of online control of a linear dynamical system from two differen...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
In this paper, we show that for arbitrary stochastic linear dynamical systems, the problem of optimi...
Thesis (Ph.D.)--University of Washington, 2023This dissertation makes contributions to decision-maki...
We consider the problem of controlling an unknown linear dynamical system under a stochastic convex ...
In the last century, the problem of controlling a dynamical system has been a core component in nume...
We consider the problem of controlling an unknown linear dynamical system under adversarially changi...
We propose an algorithm based on online convex optimization for controlling discrete-time linear dyn...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We provide the first algorithm for online bandit linear optimization whose regret after T rounds is ...
We consider online convex optimizations in the bandit setting. The decision maker does not know the ...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
Consider the online convex optimization problem, in which a player has to choose ac-tions iterativel...
Bandit convex optimization is a special case of online convex optimization with partial information....
We consider the fundamental problem of online control of a linear dynamical system from two differen...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
In the online linear optimization problem, a learner must choose, in each round, a decision from a s...
In this paper, we show that for arbitrary stochastic linear dynamical systems, the problem of optimi...
Thesis (Ph.D.)--University of Washington, 2023This dissertation makes contributions to decision-maki...