LGNEFeb 8, 2016

Binarized Neural Networks

arXiv:1602.02505v31334 citations
Originality Incremental advance
AI Analysis

This work addresses efficiency challenges in deploying neural networks on resource-constrained devices, offering a novel approach that is not incremental but introduces a new paradigm for model optimization.

The paper tackles the problem of reducing computational and memory costs in neural networks by introducing Binarized Neural Networks (BNNs) with binary weights and activations, achieving nearly state-of-the-art results on datasets like MNIST, CIFAR-10, and SVHN, and demonstrating a 7x speedup on MNIST with optimized GPU kernels without accuracy loss.

We introduce a method to train Binarized Neural Networks (BNNs) - neural networks with binary weights and activations at run-time and when computing the parameters' gradient at train-time. We conduct two sets of experiments, each based on a different framework, namely Torch7 and Theano, where we train BNNs on MNIST, CIFAR-10 and SVHN, and achieve nearly state-of-the-art results. During the forward pass, BNNs drastically reduce memory size and accesses, and replace most arithmetic operations with bit-wise operations, which might lead to a great increase in power-efficiency. Last but not least, we wrote a binary matrix multiplication GPU kernel with which it is possible to run our MNIST BNN 7 times faster than with an unoptimized GPU kernel, without suffering any loss in classification accuracy. The code for training and running our BNNs is available.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes