Augment Your Batch: Improving Generalization Through Instance Repetition

Hoffer, Elad
Ben-Nun, Tal
Hubara, Itay
Giladi, Niv
Hoefler, Torsten
Soudry, Daniel

Open link

Publication date

January 2020

DOI

10.1109/CVPR42600.2020.00815

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Abstract

Large-batch SGD is important for scaling training of deep neural networks. However, without fine-tuning hyperparameter schedules, the generalization of the model may be hampered. We propose to use batch augmentation: replicating instances of samples within the same batch with different data augmentations. Batch augmentation acts as a regularizer and an accelerator, increasing both generalization and performance scaling for a fixed budget of optimization steps. We analyze the effect of batch augmentation on gradient variance and show that it empirically improves convergence for a wide variety of networks and datasets. Our results show that batch augmentation reduces the number of necessary SGD updates to achieve the same accuracy as the stat...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Augment Your Batch: Improving Generalization Through Instance Repetition

Abstract

Extracted data

Augment Your Batch: Improving Generalization Through Instance Repetition

Abstract

Extracted data

Related items

Related items