Basic binary relations such as equality and inequality are fundamental to relational data structures. Neural networks should learn such relations and generalise to new unseen data. We show in this study, however, that this generalisation fails with standard feed-forward networks on binary vectors. Even when trained with maximal training data, standard networks do not reliably detect equality. We introduce differential rectifier (DR) units that we add to the network in different configurations. The DR units create an inductive bias in the networks, so that they do learn to generalise, even from small numbers of examples and we have not found any negative effect of their inclusion in the network. Given the fundamental nature of these relat...
This research demonstrates a method of discriminating the numerical relationships of neural network ...
Recurrent neural networks are capable of learning context-free tasks, however learning performance i...
We study the structural and statistical properties of $\mathcal{R}$-norm minimizing interpolants of ...
Many researchers implicitly assume that neural networks learn relations and generalise them to new u...
While modern deep neural architectures generalise well when test data is sampled from the same distr...
Learning abstract and systematic relations has been an open issue in neural network learning for ove...
Animals receive noisy and incomplete information, from which we must learn how to react in novel sit...
Relational data is ubiquitous in modern-day computing, and drives several key applications across mu...
The ability to learn abstractions and generalise is seen as the essence of human intelligence.7 Sinc...
In this paper, we show that standard feed-forward and recurrent neural networks fail to learn abstra...
Despite enormous progress in machine learning, artificial neural networks still lag behind brains in...
Deep neural networks have been widely used for various applications and have produced state-of-the-a...
We show that deep networks trained to satisfy demographic parity often do so through a form of race ...
Current deep learning approaches have shown good in-distribution generalization performance, but str...
This electronic version was submitted by the student author. The certified thesis is available in th...
This research demonstrates a method of discriminating the numerical relationships of neural network ...
Recurrent neural networks are capable of learning context-free tasks, however learning performance i...
We study the structural and statistical properties of $\mathcal{R}$-norm minimizing interpolants of ...
Many researchers implicitly assume that neural networks learn relations and generalise them to new u...
While modern deep neural architectures generalise well when test data is sampled from the same distr...
Learning abstract and systematic relations has been an open issue in neural network learning for ove...
Animals receive noisy and incomplete information, from which we must learn how to react in novel sit...
Relational data is ubiquitous in modern-day computing, and drives several key applications across mu...
The ability to learn abstractions and generalise is seen as the essence of human intelligence.7 Sinc...
In this paper, we show that standard feed-forward and recurrent neural networks fail to learn abstra...
Despite enormous progress in machine learning, artificial neural networks still lag behind brains in...
Deep neural networks have been widely used for various applications and have produced state-of-the-a...
We show that deep networks trained to satisfy demographic parity often do so through a form of race ...
Current deep learning approaches have shown good in-distribution generalization performance, but str...
This electronic version was submitted by the student author. The certified thesis is available in th...
This research demonstrates a method of discriminating the numerical relationships of neural network ...
Recurrent neural networks are capable of learning context-free tasks, however learning performance i...
We study the structural and statistical properties of $\mathcal{R}$-norm minimizing interpolants of ...