LRP-based network pruning and policy distillation of robust and non-robust DRL agents for embedded systems

Luan, Siyu
Gu, Zonghua
Xu, Rui
Zhao, Qingling
Chen, Gang

Publication date

January 2023

DOI

Abstract

Reinforcement learning (RL) is an effective approach to developing control policies by maximizing the agent's reward. Deep reinforcement learning uses deep neural networks (DNNs) for function approximation in RL, and has achieved tremendous success in recent years. Large DNNs often incur significant memory size and computational overheads, which may impede their deployment into resource-constrained embedded systems. For deployment of a trained RL agent on embedded systems, it is necessary to compress the policy network of the RL agent to improve its memory and computation efficiency. In this article, we perform model compression of the policy network of an RL agent by leveraging the relevance scores computed by layer-wise relevance propagat...