We consider the problem of online adaptive control of the linear quadratic regulator, where the true system parameters are unknown. We prove new upper and lower bounds demonstrating that the optimal regret scales as $\widetilde{\Theta}({\sqrt{d_{\mathbf{u}}^2 d_{\mathbf{x}} T}})$, where $T$ is the number of time steps, $d_{\mathbf{u}}$ is the dimension of the input space, and $d_{\mathbf{x}}$ is the dimension of the system state. Notably, our lower bounds rule out the possibility of a $\mathrm{poly}(\log{}T)$-regret algorithm, which had been conjectured due to the apparent strong convexity of the problem. Our upper bound is attained by a simple variant of $\textit{{certainty equivalent control}}$, where the learner selects control inputs ac...
Optimal control theory and machine learning techniques are combined to propose and solve in closed f...
In the last century, the problem of controlling a dynamical system has been a core component in nume...
We address the problem of simultaneously learning and control in an online receding horizon control ...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We study the problem of learning decentralized linear quadratic regulator when the system model is u...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...
Stabilizing the unknown dynamics of a control system and minimizing regret in control of an unknown ...
We study the impact of predictions in online Linear Quadratic Regulator control with both stochastic...
We study the average cost Linear Quadratic (LQ) control problem with unknown model parameters, also ...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
Reinforcement learning (RL) has demonstrated impressive performance in various domains such as video...
This paper addresses the optimal control problem known as the linear quadratic regulator in the case...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
In this paper, we study the statistical difficulty of learning to control linear systems. We focus o...
Optimal control theory and machine learning techniques are combined to propose and solve in closed f...
In the last century, the problem of controlling a dynamical system has been a core component in nume...
We address the problem of simultaneously learning and control in an online receding horizon control ...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We study the problem of learning decentralized linear quadratic regulator when the system model is u...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...
Stabilizing the unknown dynamics of a control system and minimizing regret in control of an unknown ...
We study the impact of predictions in online Linear Quadratic Regulator control with both stochastic...
We study the average cost Linear Quadratic (LQ) control problem with unknown model parameters, also ...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
Reinforcement learning (RL) has demonstrated impressive performance in various domains such as video...
This paper addresses the optimal control problem known as the linear quadratic regulator in the case...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
In this paper, we study the statistical difficulty of learning to control linear systems. We focus o...
Optimal control theory and machine learning techniques are combined to propose and solve in closed f...
In the last century, the problem of controlling a dynamical system has been a core component in nume...
We address the problem of simultaneously learning and control in an online receding horizon control ...