Recent work has shown potential in using Mixed Integer Programming (MIP) solvers to optimize certain aspects of neural networks (NNs). However the intriguing approach of training NNs with MIP solvers is under-explored. State-of-the-art-methods to train NNs are typically gradient-based and require significant data, computation on GPUs, and extensive hyper-parameter tuning. In contrast, training with MIP solvers does not require GPUs or heavy hyper-parameter tuning, but currently cannot handle anything but small amounts of data. This article builds on recent advances that train binarized NNs using MIP solvers. We go beyond current work by formulating new MIP models which improve training efficiency and which can train the important class of i...
In this paper, low end Digital Signal Processors (DSPs) are applied to accelerate integer neural net...
In this work we present neural network train-ing algorithms, which are based on the differ-ential ev...
We consider the algorithmic problem of finding the optimal weights and biases for a two-layer fully ...
Recent work has shown potential in using Mixed Integer Programming (MIP) solvers to optimize certain...
Artificial Neural Networks (ANNs) are prevalent machine learning models that are applied across vari...
Mixed-integer programming (MIP) technology offers a generic way of formulating and solving combinato...
The literature has shown how to optimize and analyze the parameters of different types of neural net...
Discrete black-box optimization problems are challenging for model-based optimization (MBO) algorith...
Large Neighborhood Search (LNS) is a combinatorial optimization heuristic that starts with an assign...
Recent deep learning models are difficult to train using a large batch size, because commodity machi...
Spiking neural networks (SNNs) have achieved orders of magnitude improvement in terms of energy cons...
We consider the algorithmic problem of finding the optimal weights and biases for a two-layer fully ...
Due to limited size, cost and power, embedded devices do not offer the same computational throughput...
The conventional multilayer feedforward network having continuous-weights is expensive to implement...
End-to-end training of neural network solvers for graph combinatorial optimization problems such as ...
In this paper, low end Digital Signal Processors (DSPs) are applied to accelerate integer neural net...
In this work we present neural network train-ing algorithms, which are based on the differ-ential ev...
We consider the algorithmic problem of finding the optimal weights and biases for a two-layer fully ...
Recent work has shown potential in using Mixed Integer Programming (MIP) solvers to optimize certain...
Artificial Neural Networks (ANNs) are prevalent machine learning models that are applied across vari...
Mixed-integer programming (MIP) technology offers a generic way of formulating and solving combinato...
The literature has shown how to optimize and analyze the parameters of different types of neural net...
Discrete black-box optimization problems are challenging for model-based optimization (MBO) algorith...
Large Neighborhood Search (LNS) is a combinatorial optimization heuristic that starts with an assign...
Recent deep learning models are difficult to train using a large batch size, because commodity machi...
Spiking neural networks (SNNs) have achieved orders of magnitude improvement in terms of energy cons...
We consider the algorithmic problem of finding the optimal weights and biases for a two-layer fully ...
Due to limited size, cost and power, embedded devices do not offer the same computational throughput...
The conventional multilayer feedforward network having continuous-weights is expensive to implement...
End-to-end training of neural network solvers for graph combinatorial optimization problems such as ...
In this paper, low end Digital Signal Processors (DSPs) are applied to accelerate integer neural net...
In this work we present neural network train-ing algorithms, which are based on the differ-ential ev...
We consider the algorithmic problem of finding the optimal weights and biases for a two-layer fully ...