LGIVOct 8, 2021

Dynamic Binary Neural Network by learning channel-wise thresholds

arXiv:2110.05185v123 citations
Originality Incremental advance
AI Analysis

This work addresses performance limitations in binary neural networks for efficient deployment on portable devices, representing an incremental improvement over existing methods.

The paper tackled the problem of binary neural networks' sensitivity to activation distribution by introducing dynamic learnable channel-wise thresholds, achieving top-1 accuracy improvements of 1.8% and 1.5% on ImageNet with MobileNetV1 and ResNet18 backbones, respectively.

Binary neural networks (BNNs) constrain weights and activations to +1 or -1 with limited storage and computational cost, which is hardware-friendly for portable devices. Recently, BNNs have achieved remarkable progress and been adopted into various fields. However, the performance of BNNs is sensitive to activation distribution. The existing BNNs utilized the Sign function with predefined or learned static thresholds to binarize activations. This process limits representation capacity of BNNs since different samples may adapt to unequal thresholds. To address this problem, we propose a dynamic BNN (DyBNN) incorporating dynamic learnable channel-wise thresholds of Sign function and shift parameters of PReLU. The method aggregates the global information into the hyper function and effectively increases the feature expression ability. The experimental results prove that our method is an effective and straightforward way to reduce information loss and enhance performance of BNNs. The DyBNN based on two backbones of ReActNet (MobileNetV1 and ResNet18) achieve 71.2% and 67.4% top1-accuracy on ImageNet dataset, outperforming baselines by a large margin (i.e., 1.8% and 1.5% respectively).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes