LGJun 10, 2022

Training Neural Networks using SAT solvers

arXiv:2206.04833v11.8h-index: 31

Originality Incremental advance

AI Analysis

This addresses a fundamental limitation in neural network training for specific mathematical functions where gradient methods fail, though it appears incremental for broader applications.

The authors tackled the problem of gradient-based optimization getting stuck in local optima for certain learning tasks like parity functions by proposing a global optimization method using SAT solvers to train neural networks. They demonstrated effectiveness against ADAM optimizer in parity learning tasks, though performance was unsatisfactory on MNIST image classification.

We propose an algorithm to explore the global optimization method, using SAT solvers, for training a neural net. Deep Neural Networks have achieved great feats in tasks like-image recognition, speech recognition, etc. Much of their success can be attributed to the gradient-based optimisation methods, which scale well to huge datasets while still giving solutions, better than any other existing methods. However, there exist learning problems like the parity function and the Fast Fourier Transform, where a neural network using gradient-based optimisation algorithm can not capture the underlying structure of the learning task properly. Thus, exploring global optimisation methods is of utmost interest as the gradient-based methods get stuck in local optima. In the experiments, we demonstrate the effectiveness of our algorithm against the ADAM optimiser in certain tasks like parity learning. However, in the case of image classification on the MNIST Dataset, the performance of our algorithm was less than satisfactory. We further discuss the role of the size of the training dataset and the hyper-parameter settings in keeping things scalable for a SAT solver.

View on arXiv PDF

Similar