Thesis (Ph.D.)--University of Washington, 2022Sequential decision making, especially in the face of uncertainty, is a central challenge in our quest to build increasingly safe, capable, and (seemingly-)intelligent autonomous systems. Whereas supervised learning is concerned with selecting optimal actions in independent interactions with an environment, policy learning studies action selection in sequential, dependent interactions in an environment. Policy learning constitutes an elemental component of many techniques across reinforcement learning, optimal control, and robotics. In this dissertation, a variety of perspectives on policy learning are presented, each taking a slightly different lens to the problem. We analyze theoretical and pr...
International audiencePolicy search is a method for approximately solving an optimal control problem...
In robotics, elementary behaviour patterns often tackle control theoretic problems. Because of incom...
Much of the focus on finding good representations in reinforcement learning has been on learning com...
We present an in-depth survey of policy gradient methods as they are used in the machine learning co...
Policy Learning approaches are among the best suited methods for high-dimensional, continuous contro...
This thesis is mostly focused on reinforcement learning, which is viewed as an optimization problem:...
It is known that existing policy gradient methods (such as vanilla policy gradient, PPO, A2C) may su...
textabstractMany traditional reinforcement-learning algorithms have been designed for problems with ...
Policy gradient methods are a type of reinforcement learning techniques that rely upon optimizing pa...
Policy Gradient (PG) algorithms are among the best candidates for the much-anticipated applications ...
Policy gradient (PG) algorithms are among the best candidates for the much-anticipated applications ...
We study policy gradient (PG) for reinforcement learning in continuous time and space under the regu...
A policy gradient method is a reinforcement learning approach that directly optimizes a parametrized...
Gradient-based methods have been widely used for system design and optimization in diverse applicati...
Thesis (Ph.D.), Computer Science, Washington State UniversityTransfer learning is a method in machin...
International audiencePolicy search is a method for approximately solving an optimal control problem...
In robotics, elementary behaviour patterns often tackle control theoretic problems. Because of incom...
Much of the focus on finding good representations in reinforcement learning has been on learning com...
We present an in-depth survey of policy gradient methods as they are used in the machine learning co...
Policy Learning approaches are among the best suited methods for high-dimensional, continuous contro...
This thesis is mostly focused on reinforcement learning, which is viewed as an optimization problem:...
It is known that existing policy gradient methods (such as vanilla policy gradient, PPO, A2C) may su...
textabstractMany traditional reinforcement-learning algorithms have been designed for problems with ...
Policy gradient methods are a type of reinforcement learning techniques that rely upon optimizing pa...
Policy Gradient (PG) algorithms are among the best candidates for the much-anticipated applications ...
Policy gradient (PG) algorithms are among the best candidates for the much-anticipated applications ...
We study policy gradient (PG) for reinforcement learning in continuous time and space under the regu...
A policy gradient method is a reinforcement learning approach that directly optimizes a parametrized...
Gradient-based methods have been widely used for system design and optimization in diverse applicati...
Thesis (Ph.D.), Computer Science, Washington State UniversityTransfer learning is a method in machin...
International audiencePolicy search is a method for approximately solving an optimal control problem...
In robotics, elementary behaviour patterns often tackle control theoretic problems. Because of incom...
Much of the focus on finding good representations in reinforcement learning has been on learning com...