Block-Skim: Efficient Question Answering for Transformer

Guan, Yue
Li, Zhengyi
Lin, Zhouhan
Zhu, Yuhao
Leng, Jingwen
Guo, Minyi

Open link

Publication date

June 2022

DOI

10.1609/aaai.v36i10.21316

Publisher

Association for the Advancement of Artificial Intelligence

Abstract

Transformer models have achieved promising results on natural language processing (NLP) tasks including extractive question answering (QA). Common Transformer encoders used in NLP tasks process the hidden states of all input tokens in the context paragraph throughout all layers. However, different from other tasks such as sequence classification, answering the raised question does not necessarily need all the tokens in the context paragraph. Following this motivation, we propose Block-skim, which learns to skim unnecessary context in higher hidden layers to improve and accelerate the Transformer performance. The key idea of Block-Skim is to identify the context that must be further processed and those that could be safely discarded early on...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Block-Skim: Efficient Question Answering for Transformer

Abstract

Extracted data

Block-Skim: Efficient Question Answering for Transformer

Abstract

Extracted data

Related items

Related items