We study the problem of regret minimization in partially observable linear quadratic control systems when the model dynamics are unknown a priori. We propose ExpCommit, an explore-then-commit algorithm that learns the model Markov parameters and then follows the principle of optimism in the face of uncertainty to design a controller. We propose a novel way to decompose the regret and provide an end-to-end sublinear regret upper bound for partially observable linear quadratic control. Finally, we provide stability guarantees and establish a regret upper bound of O(T^(2/3)) for ExpCommit, where T is the time horizon of the problem
The study of online control of unknown time varying dynamical systems is a relatively under-explored...
Optimal controllers are usually designed to minimize cost under the assumption that the disturbance ...
This paper addresses the optimal control problem known as the linear quadratic regulator in the case...
We study the problem of regret minimization in partially observable linear quadratic control systems...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
We study the average cost Linear Quadratic (LQ) control problem with unknown model parameters, also ...
Stabilizing the unknown dynamics of a control system and minimizing regret in control of an unknown ...
This paper studies online solutions for regret-optimal control in partially observable systems over ...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We present fundamental limitations for the regret of adaptive control of the linear quadratic regula...
We study the problem of learning decentralized linear quadratic regulator when the system model is u...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
The study of online control of unknown time varying dynamical systems is a relatively under-explored...
Optimal controllers are usually designed to minimize cost under the assumption that the disturbance ...
This paper addresses the optimal control problem known as the linear quadratic regulator in the case...
We study the problem of regret minimization in partially observable linear quadratic control systems...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
We study the average cost Linear Quadratic (LQ) control problem with unknown model parameters, also ...
Stabilizing the unknown dynamics of a control system and minimizing regret in control of an unknown ...
This paper studies online solutions for regret-optimal control in partially observable systems over ...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We present fundamental limitations for the regret of adaptive control of the linear quadratic regula...
We study the problem of learning decentralized linear quadratic regulator when the system model is u...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
The study of online control of unknown time varying dynamical systems is a relatively under-explored...
Optimal controllers are usually designed to minimize cost under the assumption that the disturbance ...
This paper addresses the optimal control problem known as the linear quadratic regulator in the case...