We study the problem of adaptive control in partially observable linear dynamical systems. We propose a novel algorithm, adaptive control online learning algorithm (AdaptOn), which efficiently explores the environment, estimates the system dynamics episodically and exploits these estimates to design effective controllers to minimize the cumulative costs. Through interaction with the environment, AdaptOn deploys online convex optimization to optimize the controller while simultaneously learning the system dynamics to improve the accuracy of controller updates. We show that when the cost functions are strongly convex, after T times step of agent-environment interaction, AdaptOn achieves regret upper bound of polylog(T). To the best of our kno...
We study the control of an \emph{unknown} linear dynamical system under general convex costs. The ob...
We study the problem of online learning in predictive control of an unknown linear dynamical system ...
We address the problem of simultaneously learning and control in an online receding horizon control ...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
In the last century, the problem of controlling a dynamical system has been a core component in nume...
The study of online control of unknown time varying dynamical systems is a relatively under-explored...
We study the problem of regret minimization in partially observable linear quadratic control systems...
Learning algorithms play an ever increasing role in modern engineering solutions. However, despite m...
We study the average cost Linear Quadratic (LQ) control problem with unknown model parameters, also ...
Optimal controllers are usually designed to minimize cost under the assumption that the disturbance ...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...
We study the problem of learning decentralized linear quadratic regulator when the system model is u...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
We study the control of an \emph{unknown} linear dynamical system under general convex costs. The ob...
We study the problem of online learning in predictive control of an unknown linear dynamical system ...
We address the problem of simultaneously learning and control in an online receding horizon control ...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
In the last century, the problem of controlling a dynamical system has been a core component in nume...
The study of online control of unknown time varying dynamical systems is a relatively under-explored...
We study the problem of regret minimization in partially observable linear quadratic control systems...
Learning algorithms play an ever increasing role in modern engineering solutions. However, despite m...
We study the average cost Linear Quadratic (LQ) control problem with unknown model parameters, also ...
Optimal controllers are usually designed to minimize cost under the assumption that the disturbance ...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...
We study the problem of learning decentralized linear quadratic regulator when the system model is u...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
We study the control of an \emph{unknown} linear dynamical system under general convex costs. The ob...
We study the problem of online learning in predictive control of an unknown linear dynamical system ...
We address the problem of simultaneously learning and control in an online receding horizon control ...