We study the average cost Linear Quadratic (LQ) control problem with unknown model parameters, also known as the adaptive control problem in the control community. We design an algorithm and prove that apart from logarithmic factors its regret up to time T is O( T). Unlike previous approaches that use a forced-exploration scheme, we construct a high-probability confidence set around the model parameters and design an algorithm that plays optimistically with respect to this confidence set. The construction of the confidence set is based on the recent results from online least-squares estimation and leads to improved worst-case regret bound for the proposed algorithm. To the best of our knowledge this is the the first time that a regret bound...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
Learning algorithms play an ever increasing role in modern engineering solutions. However, despite m...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...
We present fundamental limitations for the regret of adaptive control of the linear quadratic regula...
Optimal control for the canonical model of systems with linear dynamics and quadratic operating cost...
Stabilizing the unknown dynamics of a control system and minimizing regret in control of an unknown ...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We study the problem of regret minimization in partially observable linear quadratic control systems...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
International audienceWe consider the exploration-exploitation tradeoff in linear quadratic (LQ) con...
©1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish thi...
A novel method of an adaptive linear quadratic (LQ) regulation of uncertain continuous linear time-i...
Optimal controllers are usually designed to minimize cost under the assumption that the disturbance ...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
Learning algorithms play an ever increasing role in modern engineering solutions. However, despite m...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...
We present fundamental limitations for the regret of adaptive control of the linear quadratic regula...
Optimal control for the canonical model of systems with linear dynamics and quadratic operating cost...
Stabilizing the unknown dynamics of a control system and minimizing regret in control of an unknown ...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We study the problem of regret minimization in partially observable linear quadratic control systems...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
International audienceWe consider the exploration-exploitation tradeoff in linear quadratic (LQ) con...
©1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish thi...
A novel method of an adaptive linear quadratic (LQ) regulation of uncertain continuous linear time-i...
Optimal controllers are usually designed to minimize cost under the assumption that the disturbance ...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
Learning algorithms play an ever increasing role in modern engineering solutions. However, despite m...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...