Policy Learning with Embedded Koopman Optimal Control

Yin, Hang
Welle, Michael C.
Kragic, Danica

Publisher

KTH, Centrum för autonoma system, CAS

Abstract

Embedding an optimization process has been explored for imposing efficient and flexible policy structures. Existing work often build upon nonlinear optimization with explicitly unrolling of iteration steps, making policy inference prohibitively expensive for online learning and real-time control. Our approach embeds a linear-quadratic-regulator (LQR) formulation with a Koopman representation, thus exhibiting the tractability from a closed-form solution and richness from a non-convex neural network. We use a few auxiliary objectives and reparameterization to enforce optimality conditions of the policy that can be easily integrated to standard gradient-based learning. Our approach is shown to be effective for learning policies rendering an op...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Policy Learning with Embedded Koopman Optimal Control

Abstract

Extracted data

Policy Learning with Embedded Koopman Optimal Control

Abstract

Extracted data

Related items

Related items