Scaling model capacity has been vital in the success of deep learning. For a typical network, necessary compute resources and training time grow dramatically with model size. Conditional computation is a promising way to increase the number of parameters with a relatively small increase in resources. We propose a training algorithm that flexibly chooses neural modules based on the data to be processed. Both the decomposition and modules are learned end-to-end. In contrast to existing approaches, training does not rely on regularization to enforce diversity in module use. We apply modular networks both to image recognition and language modeling tasks, where we achieve superior performance compared to several baselines. Introspection reveals ...
Abstract—A novel modular perceptron network (MPN) and divide-and-conquer learning (DCL) schemes for ...
A long-standing goal in artificial intelligence is creating agents that can learn a variety of diffe...
Artificial neural networks are deep machine learning models that excel at complex artificial intelli...
Problem description. The learning of monolithic neural networks becomes harder with growing network ...
Neural networks that learn the What and Where task perform better if they possess a modular architec...
Artificial intelligence is a kind of technology that simulates human intelligence. It uses computer...
International audienceA long-standing goal in artificial intelligence is creating agents that can le...
Pattern recognition has become an accessible tool in developing advanced adaptive products. The need...
The brain can be viewed as a complex modular structure with features of information processing throu...
We present a neural network architecture and a training algorithm designed to enable very rapid trai...
AbstractLearning of large-scale neural networks suffers from computational cost and the local minima...
As the size of deep learning models continues to grow, finding optimal models under memory and compu...
Modular neural networks (MNNs) are increasingly popular models for dealing with complex problems con...
One of the obstacles that hinder the development of Artificial Neural Networks (ANNs) is the heavy c...
Modular connectionist systems comprise autonomous, communicating modules, achieving a behaviour more...
Abstract—A novel modular perceptron network (MPN) and divide-and-conquer learning (DCL) schemes for ...
A long-standing goal in artificial intelligence is creating agents that can learn a variety of diffe...
Artificial neural networks are deep machine learning models that excel at complex artificial intelli...
Problem description. The learning of monolithic neural networks becomes harder with growing network ...
Neural networks that learn the What and Where task perform better if they possess a modular architec...
Artificial intelligence is a kind of technology that simulates human intelligence. It uses computer...
International audienceA long-standing goal in artificial intelligence is creating agents that can le...
Pattern recognition has become an accessible tool in developing advanced adaptive products. The need...
The brain can be viewed as a complex modular structure with features of information processing throu...
We present a neural network architecture and a training algorithm designed to enable very rapid trai...
AbstractLearning of large-scale neural networks suffers from computational cost and the local minima...
As the size of deep learning models continues to grow, finding optimal models under memory and compu...
Modular neural networks (MNNs) are increasingly popular models for dealing with complex problems con...
One of the obstacles that hinder the development of Artificial Neural Networks (ANNs) is the heavy c...
Modular connectionist systems comprise autonomous, communicating modules, achieving a behaviour more...
Abstract—A novel modular perceptron network (MPN) and divide-and-conquer learning (DCL) schemes for ...
A long-standing goal in artificial intelligence is creating agents that can learn a variety of diffe...
Artificial neural networks are deep machine learning models that excel at complex artificial intelli...