In this work, we study model-based reinforcement learning (RL) in unknown stabilizable linear dynamical systems. When learning a dynamical system, one needs to stabilize the unknown dynamics in order to avoid system blow-ups. We propose an algorithm that certifies fast stabilization of the underlying system by effectively exploring the environment with an improved exploration strategy. We show that the proposed algorithm attains $\tilde{\mathcal{O}}(\sqrt{T})$ regret after $T$ time steps of agent-environment interaction. We also show that the regret of the proposed algorithm has only a polynomial dependence in the problem dimensions, which gives an exponential improvement over the prior methods. Our improved exploration method is simple, ye...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We consider the linear quadratic regulation problem when the plant is an unknown linear dynamical sy...
Stabilizing the unknown dynamics of a control system and minimizing regret in control of an unknown ...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
With potential applications as diverse as self-driving cars, medical robots, and network protocols, ...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
Model-free reinforcement learning has seen tremendous advances in the last few years, however practi...
Gradient-based methods have been widely used for system design and optimization in diverse applicati...
Thesis (Ph.D.)--University of Washington, 2020In this thesis, we shall study optimal control problem...
Reinforcement learning (RL) has demonstrated impressive performance in various domains such as video...
A novel method of an adaptive linear quadratic (LQ) regulation of uncertain continuous linear time-i...
A reinforcement learning (RL) control policy could fail in a new/perturbed environment that is diffe...
International audienceIn some reinforcement learning problems an agent may be provided with a set of...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We consider the linear quadratic regulation problem when the plant is an unknown linear dynamical sy...
Stabilizing the unknown dynamics of a control system and minimizing regret in control of an unknown ...
We study the problem of adaptive control in partially observable linear quadratic Gaussian control s...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
With potential applications as diverse as self-driving cars, medical robots, and network protocols, ...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
Model-free reinforcement learning has seen tremendous advances in the last few years, however practi...
Gradient-based methods have been widely used for system design and optimization in diverse applicati...
Thesis (Ph.D.)--University of Washington, 2020In this thesis, we shall study optimal control problem...
Reinforcement learning (RL) has demonstrated impressive performance in various domains such as video...
A novel method of an adaptive linear quadratic (LQ) regulation of uncertain continuous linear time-i...
A reinforcement learning (RL) control policy could fail in a new/perturbed environment that is diffe...
International audienceIn some reinforcement learning problems an agent may be provided with a set of...
In this thesis, we study the related problems of reinforcement learning and optimal adaptive control...
The field of linear control has seen broad application in fields as diverse as robotics, aviation,...
We consider the linear quadratic regulation problem when the plant is an unknown linear dynamical sy...