Neural Networks and the Chomsky Hierarchy

Delétang, Grégoire
Ruoss, Anian
Grau-Moya, Jordi
Genewein, Tim
Wenliang, Li Kevin
Catt, Elliot
Hutter, Marcus
Legg, Shane
Ortega, Pedro A.

Publication date

July 2022

Abstract

Reliable generalization lies at the heart of safe ML and AI. However, understanding when and how neural networks generalize remains one of the most important unsolved problems in the field. In this work, we conduct an extensive empirical study (2200 models, 16 tasks) to investigate whether insights from the theory of computation can predict the limits of neural network generalization in practice. We demonstrate that grouping tasks according to the Chomsky hierarchy allows us to forecast whether certain architectures will be able to generalize to out-of-distribution inputs. This includes negative results where even extensive amounts of data and training time never led to any non-trivial generalization, despite models having sufficient capaci...

Extracted data

We use cookies to provide a better user experience.

Data Protection

Neural Networks and the Chomsky Hierarchy

Abstract

Extracted data

Neural Networks and the Chomsky Hierarchy

Abstract

Extracted data

Related items

Related items