Efficient Representation Learning via Adaptive Context Pooling

Huang, Chen
Talbott, Walter
Jaitly, Navdeep
Susskind, Josh

Publication date

July 2022

Abstract

Self-attention mechanisms model long-range context by using pairwise attention between all input tokens. In doing so, they assume a fixed attention granularity defined by the individual tokens (e.g., text characters or image pixels), which may not be optimal for modeling complex dependencies at higher levels. In this paper, we propose ContextPool to address this problem by adapting the attention granularity for each token. Inspired by the success of ConvNets that are combined with pooling to capture long-range dependencies, we learn to pool neighboring features for each token before computing attention in a given attention layer. The pooling weights and support size are adaptively determined, allowing the pooled features to encode meaningfu...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Efficient Representation Learning via Adaptive Context Pooling

Abstract

Extracted data

Efficient Representation Learning via Adaptive Context Pooling

Abstract

Extracted data

Topics

Related items

Topics

Related items