LGNEMay 12, 2023

$\partial\mathbb{B}$ nets: learning discrete functions by gradient descent

arXiv:2305.07315v1
Originality Incremental advance
AI Analysis

This provides a method for creating compact and interpretable neural networks for machine learning applications, though it appears incremental as it builds on neural network binarization approaches.

The paper tackles the problem of learning discrete boolean-valued functions by gradient descent, introducing $\partial\mathbb{B}$ nets that achieve comparable performance to standard methods while using compact 1-bit weights and offering interpretability through logical functions.

$\partial\mathbb{B}$ nets are differentiable neural networks that learn discrete boolean-valued functions by gradient descent. $\partial\mathbb{B}$ nets have two semantically equivalent aspects: a differentiable soft-net, with real weights, and a non-differentiable hard-net, with boolean weights. We train the soft-net by backpropagation and then `harden' the learned weights to yield boolean weights that bind with the hard-net. The result is a learned discrete function. `Hardening' involves no loss of accuracy, unlike existing approaches to neural network binarization. Preliminary experiments demonstrate that $\partial\mathbb{B}$ nets achieve comparable performance on standard machine learning problems yet are compact (due to 1-bit weights) and interpretable (due to the logical nature of the learnt functions).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes