Evaluating self-attention interpretability through human-grounded experimental protocol

Bhan, Milan
Achache, Nina
Legrand, Victor
Blangero, Annabelle
Chesneau, Nicolas

Open link

Publication date

July 2023

DOI

10.1007/978-3-031-44070-0_2

Publisher

Springer Nature Switzerland

Abstract

International audienceAttention mechanisms have played a crucial role in the development of complex architectures such as Transformers in natural language processing. However, Transformers remain hard to interpret and are considered as black-boxes. In this paper we assess how attention coefficients from Transformers help in providing classifier interpretability when properly aggregated. A fast and easy-to-implement way of aggregating attention is proposed to build local feature importance. A human-grounded experiment is conducted to evaluate and compare this approach to other usual interpretability methods. The experimental protocol relies on the capacity of an interpretability method to provide explanation in line with human reasoning. Exp...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Evaluating self-attention interpretability through human-grounded experimental protocol

Abstract

Extracted data

Evaluating self-attention interpretability through human-grounded experimental protocol

Abstract

Extracted data

Related items

Related items