CVFeb 8, 2022

Binary Neural Networks as a general-propose compute paradigm for on-device computer vision

arXiv:2202.03716v11 citations
Originality Highly original
AI Analysis

This work addresses the problem of efficient on-device computer vision for mobile and embedded systems, offering a potentially transformative solution with broad applicability across tasks like classification and detection.

The paper tackles the challenge of making binary neural networks (BNNs) a mainstream on-device computer vision algorithm by proposing a framework that achieves a superior speed-vs-accuracy tradeoff compared to 8-bit quantization, with BNNs retaining accuracy levels and showing 1.3-2.4× faster FPS on mobile CPUs and 2.8-7× fewer execution cycles on AI accelerators.

For binary neural networks (BNNs) to become the mainstream on-device computer vision algorithm, they must achieve a superior speed-vs-accuracy tradeoff than 8-bit quantization and establish a similar degree of general applicability in vision tasks. To this end, we propose a BNN framework comprising 1) a minimalistic inference scheme for hardware-friendliness, 2) an over-parameterized training scheme for high accuracy, and 3) a simple procedure to adapt to different vision tasks. The resultant framework overtakes 8-bit quantization in the speed-vs-accuracy tradeoff for classification, detection, segmentation, super-resolution and matching: our BNNs not only retain the accuracy levels of their 8-bit baselines but also showcase 1.3-2.4$\times$ faster FPS on mobile CPUs. Similar conclusions can be drawn for prototypical systolic-array-based AI accelerators, where our BNNs promise 2.8-7$\times$ fewer execution cycles than 8-bit and 2.1-2.7$\times$ fewer cycles than alternative BNN designs. These results suggest that the time for large-scale BNN adoption could be upon us.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes