The linear quadratic regulator (LQR) problem has reemerged as an important theoretical benchmark for reinforcement learning-based control of complex dynamical systems with continuous state and action spaces. In contrast with nearly all recent work in this area, we consider multiplicative noise models, which are increasingly relevant because they explicitly incorporate inherent uncertainty and variation in the system dynamics and thereby improve robustness properties of the controller. Robustness is a critical and poorly understood issue in reinforcement learning; existing methods which do not account for uncertainty can converge to fragile policies or fail to converge at all. Additionally, intentional injection of multiplicative noise into ...
This paper considers the Linear Quadratic Regulator problem for linear systems with unknown dynamics...
This paper presents a convex optimization-based solution to the design of state-feedback controllers...
In this thesis, the global convergence of model-based and model-free gradient descent and natural po...
We explore reinforcement learning methods for finding the optimal policy in the linear quadratic reg...
The linear quadratic framework is widely studied in the literature on stochastic control and game th...
Reinforcement learning (RL) has demonstrated impressive performance in various domains such as video...
This paper studies the robustness of reinforcement learning algorithms to errors in the learning pro...
Thesis (Ph.D.)--University of Washington, 2020In this thesis, we shall study optimal control problem...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
We consider the linear quadratic regulation problem when the plant is an unknown linear dynamical sy...
International audienceOptimal control of nonlinear systems is a difficult problem which has been add...
Robust stability and stochastic stability have separately seen intense study in control theory for m...
In this paper, we will deal with a linear quadratic optimal control problem with unknown dynamics. A...
Gradient-based methods have been widely used for system design and optimization in diverse applicati...
The optimization landscape of optimal control problems plays an important role in the convergence of...
This paper considers the Linear Quadratic Regulator problem for linear systems with unknown dynamics...
This paper presents a convex optimization-based solution to the design of state-feedback controllers...
In this thesis, the global convergence of model-based and model-free gradient descent and natural po...
We explore reinforcement learning methods for finding the optimal policy in the linear quadratic reg...
The linear quadratic framework is widely studied in the literature on stochastic control and game th...
Reinforcement learning (RL) has demonstrated impressive performance in various domains such as video...
This paper studies the robustness of reinforcement learning algorithms to errors in the learning pro...
Thesis (Ph.D.)--University of Washington, 2020In this thesis, we shall study optimal control problem...
The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamica...
We consider the linear quadratic regulation problem when the plant is an unknown linear dynamical sy...
International audienceOptimal control of nonlinear systems is a difficult problem which has been add...
Robust stability and stochastic stability have separately seen intense study in control theory for m...
In this paper, we will deal with a linear quadratic optimal control problem with unknown dynamics. A...
Gradient-based methods have been widely used for system design and optimization in diverse applicati...
The optimization landscape of optimal control problems plays an important role in the convergence of...
This paper considers the Linear Quadratic Regulator problem for linear systems with unknown dynamics...
This paper presents a convex optimization-based solution to the design of state-feedback controllers...
In this thesis, the global convergence of model-based and model-free gradient descent and natural po...