Learning Compact Features via In-Training Representation Alignment

Li, Xin
Li, Xiangrui
Pan, Deng
Qiang, Yao
Zhu, Dongxiao

Open link

Publication date

June 2023

DOI

10.1609/aaai.v37i7.26044

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

Deep neural networks (DNNs) for supervised learning can be viewed as a pipeline of the feature extractor (i.e., last hidden layer) and a linear classifier (i.e., output layer) that are trained jointly with stochastic gradient descent (SGD) on the loss function (e.g., cross-entropy). In each epoch, the true gradient of the loss function is estimated using a mini-batch sampled from the training set and model parameters are then updated with the mini-batch gradients. Although the latter provides an unbiased estimation of the former, they are subject to substantial variances derived from the size and number of sampled mini-batches, leading to noisy and jumpy updates. To stabilize such undesirable variance in estimating the true gradients, we pr...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Learning Compact Features via In-Training Representation Alignment

Abstract

Extracted data

Learning Compact Features via In-Training Representation Alignment

Abstract

Extracted data

Related items

Related items