Transformers Can Be Translated to First-Order Logic with Majority Quantifiers

Merrill, William
Sabharwal, Ashish

Publication date

November 2022

Language

English

Abstract

Characterizing the implicit structure of the computation within neural networks is a foundational problem in the area of deep learning interpretability. Can their inner decision process be captured symbolically in some familiar logic? We show that any transformer neural network can be translated into an equivalent fixed-size first-order logic formula which may also use majority quantifiers. The idea is to simulate transformers with highly uniform threshold circuits and leverage known theoretical connections between circuits and logic. Our findings also reveal the surprising fact that the entire transformer computation can be reduced merely to the division of two (large) integers. While our results are most pertinent for transformers, they a...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Transformers Can Be Translated to First-Order Logic with Majority Quantifiers

Abstract

Extracted data

Transformers Can Be Translated to First-Order Logic with Majority Quantifiers

Abstract

Extracted data

Related items

Related items