LGARDCMLNov 28, 2025

Accelerated Execution of Bayesian Neural Networks using a Single Probabilistic Forward Pass and Code Generation

arXiv:2511.23440v1
Originality Highly original
AI Analysis

This enables efficient deployment of BNNs on resource-constrained embedded systems for safety-critical applications where uncertainty handling is essential.

The paper tackles the high computational cost of Bayesian neural networks (BNNs) by proposing a Probabilistic Forward Pass (PFP) method that replaces sampling with a single deterministic forward pass, achieving speedups of up to 4200x while matching traditional BNNs in accuracy, uncertainty estimation, and out-of-domain detection on Dirty-MNIST.

Machine learning models perform well across domains such as diagnostics, weather forecasting, NLP, and autonomous driving, but their limited uncertainty handling restricts use in safety-critical settings. Traditional neural networks often fail to detect out-of-domain (OOD) data and may output confident yet incorrect predictions. Bayesian neural networks (BNNs) address this by providing probabilistic estimates, but incur high computational cost because predictions require sampling weight distributions and multiple forward passes. The Probabilistic Forward Pass (PFP) offers a highly efficient approximation to Stochastic Variational Inference (SVI) by assuming Gaussian-distributed weights and activations, enabling fully analytic uncertainty propagation and replacing sampling with a single deterministic forward pass. We present an end-to-end pipeline for training, compiling, optimizing, and deploying PFP-based BNNs on embedded ARM CPUs. Using the TVM deep learning compiler, we implement a dedicated library of Gaussian-propagating operators for multilayer perceptrons and convolutional neural networks, combined with manual and automated tuning strategies. Ablation studies show that PFP consistently outperforms SVI in computational efficiency, achieving speedups of up to 4200x for small mini-batches. PFP-BNNs match SVI-BNNs on Dirty-MNIST in accuracy, uncertainty estimation, and OOD detection while greatly reducing compute cost. These results highlight the potential of combining Bayesian approximations with code generation to enable efficient BNN deployment on resource-constrained systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes