Improving the Behaviour of Vision Transformers with Token-consistent Stochastic Layers

Popovic, Nikola
Paudel, Danda Pani
Probst, Thomas
Van Gool, Luc

Publication date

July 2022

Abstract

We introduce token-consistent stochastic layers in vision transformers, without causing any severe drop in performance. The added stochasticity improves network calibration, robustness and strengthens privacy. We use linear layers with token-consistent stochastic parameters inside the multilayer perceptron blocks, without altering the architecture of the transformer. The stochastic parameters are sampled from the uniform distribution, both during training and inference. The applied linear operations preserve the topological structure, formed by the set of tokens passing through the shared multilayer perceptron. This operation encourages the learning of the recognition task to rely on the topological structures of the tokens, instead of thei...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Improving the Behaviour of Vision Transformers with Token-consistent Stochastic Layers

Abstract

Extracted data

Improving the Behaviour of Vision Transformers with Token-consistent Stochastic Layers

Abstract

Extracted data

Related items

Related items