General Cyclical Training of Neural Networks

Smith, Leslie N.

Publication date

June 2022

Abstract

This paper describes the principle of "General Cyclical Training" in machine learning, where training starts and ends with "easy training" and the "hard training" happens during the middle epochs. We propose several manifestations for training neural networks, including algorithmic examples (via hyper-parameters and loss functions), data-based examples, and model-based examples. Specifically, we introduce several novel techniques: cyclical weight decay, cyclical batch size, cyclical focal loss, cyclical softmax temperature, cyclical data augmentation, cyclical gradient clipping, and cyclical semi-supervised learning. In addition, we demonstrate that cyclical weight decay, cyclical softmax temperature, and cyclical gradient clipping (as thre...

Extracted data

We use cookies to provide a better user experience.

Data Protection

General Cyclical Training of Neural Networks

Abstract

Extracted data

General Cyclical Training of Neural Networks

Abstract

Extracted data

Related items

Related items