We address the problem of simultaneously learning and control in an online receding horizon control setting. We consider the control of an unknown linear dynamical system with general cost functions and affine constraints on the control input. Our goal is to develop an online learning algorithm that minimizes the dynamic regret, which is defined as the difference between the cumulative cost incurred by the algorithm and that of the best policy with full knowledge of the system, cost functions and state and that satisfies the control input constraints. We propose a novel approach to explore in an online receding horizon setting. The key challenge is to ensure that the control generated by the receding horizon controller is persistently excit...
In this paper, we study the problem of efficient online reinforcement learning in the infinite horiz...
We study the online robust control problem for linear dynamical systems with disturbances and uncert...
This paper studies online solutions for regret-optimal control in partially observable systems over ...
In this paper we provide provable regret guarantees for an online meta-learning receding horizon con...
We study the problem of online learning in predictive control of an unknown linear dynamical system ...
In the last century, the problem of controlling a dynamical system has been a core component in nume...
Optimal controllers are usually designed to minimize cost under the assumption that the disturbance ...
We consider the fundamental problem of online control of a linear dynamical system from two differen...
The study of online control of unknown time varying dynamical systems is a relatively under-explored...
We study the impact of predictions in online Linear Quadratic Regulator control with both stochastic...
In this work we address the problem of the online robust control of nonlinear dynamical systems pert...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
Recently a line of work has shown the applicability of tools from online optimization for control, l...
We consider the control of linear time-varying dynamical systems from the perspective of regret mini...
In this paper, we study the problem of efficient online reinforcement learning in the infinite horiz...
We study the online robust control problem for linear dynamical systems with disturbances and uncert...
This paper studies online solutions for regret-optimal control in partially observable systems over ...
In this paper we provide provable regret guarantees for an online meta-learning receding horizon con...
We study the problem of online learning in predictive control of an unknown linear dynamical system ...
In the last century, the problem of controlling a dynamical system has been a core component in nume...
Optimal controllers are usually designed to minimize cost under the assumption that the disturbance ...
We consider the fundamental problem of online control of a linear dynamical system from two differen...
The study of online control of unknown time varying dynamical systems is a relatively under-explored...
We study the impact of predictions in online Linear Quadratic Regulator control with both stochastic...
In this work we address the problem of the online robust control of nonlinear dynamical systems pert...
We study the problem of adaptive control in partially observable linear dynamical systems. We propos...
We consider the problem of online adaptive control of the linear quadratic regulator, where the true...
Recently a line of work has shown the applicability of tools from online optimization for control, l...
We consider the control of linear time-varying dynamical systems from the perspective of regret mini...
In this paper, we study the problem of efficient online reinforcement learning in the infinite horiz...
We study the online robust control problem for linear dynamical systems with disturbances and uncert...
This paper studies online solutions for regret-optimal control in partially observable systems over ...