Efficient Generalization Improvement Guided by Random Weight Perturbation

Li, Tao
Yan, Weihao
Lei, Zehao
Wu, Yingwen
Fang, Kun
Yang, Ming
Huang, Xiaolin

Publication date

November 2022

Language

English

Abstract

To fully uncover the great potential of deep neural networks (DNNs), various learning algorithms have been developed to improve the model's generalization ability. Recently, sharpness-aware minimization (SAM) establishes a generic scheme for generalization improvements by minimizing the sharpness measure within a small neighborhood and achieves state-of-the-art performance. However, SAM requires two consecutive gradient evaluations for solving the min-max problem and inevitably doubles the training time. In this paper, we resort to filter-wise random weight perturbations (RWP) to decouple the nested gradients in SAM. Different from the small adversarial perturbations in SAM, RWP is softer and allows a much larger magnitude of perturbations....

Extracted data

We use cookies to provide a better user experience.

Data Protection

Efficient Generalization Improvement Guided by Random Weight Perturbation

Abstract

Extracted data

Efficient Generalization Improvement Guided by Random Weight Perturbation

Abstract

Extracted data

Related items

Related items