Adaptable Butterfly Accelerator for Attention-based NNs via Hardware and Algorithm Co-design

Fan, Hongxiang
Chau, Thomas
Venieris, Stylianos I.
Lee, Royson
Kouris, Alexandros
Luk, Wayne
Lane, Nicholas D.
Abdelfattah, Mohamed S.

Publication date

September 2022

Language

English

Abstract

Attention-based neural networks have become pervasive in many AI tasks. Despite their excellent algorithmic performance, the use of the attention mechanism and feed-forward network (FFN) demands excessive computational and memory resources, which often compromises their hardware performance. Although various sparse variants have been introduced, most approaches only focus on mitigating the quadratic scaling of attention on the algorithm level, without explicitly considering the efficiency of mapping their methods on real hardware designs. Furthermore, most efforts only focus on either the attention mechanism or the FFNs but without jointly optimizing both parts, causing most of the current designs to lack scalability when dealing with diffe...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Adaptable Butterfly Accelerator for Attention-based NNs via Hardware and Algorithm Co-design

Abstract

Extracted data

Adaptable Butterfly Accelerator for Attention-based NNs via Hardware and Algorithm Co-design

Abstract

Extracted data

Related items

Related items