We study the problem of adaptive control in partially observable linear quadratic Gaussian control systems, where the model dynamics are unknown a priori. We propose LqgOpt, a novel reinforcement learning algorithm based on the principle of optimism in the face of uncertainty, to effectively minimize the overall control cost. We employ the predictor state evolution representation of the system dynamics and deploy a recently proposed closed-loop system identification method, estimation, and confidence bound construction. LqgOpt efficiently explores the system dynamics, estimates the model parameters up to their confidence interval, and deploys the controller of the most optimistic model for further exploration and exploitation. We provide st...
International audienceWe consider the exploration-exploitation tradeoff in linear quadratic (LQ) con...
We present fundamental limitations for the regret of adaptive control of the linear quadratic regula...
We study the problem of learning decentralized linear quadratic regulator when the system model is u...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
We study the problem of regret minimization in partially observable linear quadratic control systems...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
We study the average cost Linear Quadratic (LQ) control problem with unknown model parameters, also ...
Stabilizing the unknown dynamics of a control system and minimizing regret in control of an unknown ...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
Optimal control for the canonical model of systems with linear dynamics and quadratic operating cost...
©1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish thi...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
In this paper, the finite-horizon near optimal adaptive regulation of linear discrete-time systems w...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
International audienceWe consider the exploration-exploitation tradeoff in linear quadratic (LQ) con...
We present fundamental limitations for the regret of adaptive control of the linear quadratic regula...
We study the problem of learning decentralized linear quadratic regulator when the system model is u...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
We study the problem of regret minimization in partially observable linear quadratic control systems...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
We study the average cost Linear Quadratic (LQ) control problem with unknown model parameters, also ...
Stabilizing the unknown dynamics of a control system and minimizing regret in control of an unknown ...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
Optimal control for the canonical model of systems with linear dynamics and quadratic operating cost...
©1999 IEEE. Personal use of this material is permitted. However, permission to reprint/republish thi...
In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynami...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
In this paper, the finite-horizon near optimal adaptive regulation of linear discrete-time systems w...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
International audienceWe consider the exploration-exploitation tradeoff in linear quadratic (LQ) con...
We present fundamental limitations for the regret of adaptive control of the linear quadratic regula...
We study the problem of learning decentralized linear quadratic regulator when the system model is u...