CVMar 21

IBCapsNet: Information Bottleneck Capsule Network for Noise-Robust Representation Learning

arXiv:2603.2068216.3h-index: 5Has Code

AI Analysis

This work addresses robustness and efficiency issues in capsule networks for computer vision tasks, offering an incremental improvement with specific gains in noise handling and speed.

The paper tackled the problems of high computational cost and poor robustness in capsule networks by proposing IBCapsNet, which uses an information bottleneck principle and variational aggregation, resulting in matching clean-data accuracy (e.g., 99.41% on MNIST) while improving noise robustness by up to +17.10% and achieving 2.54x faster training.

Capsule networks (CapsNets) are superior at modeling hierarchical spatial relationships but suffer from two critical limitations: high computational cost due to iterative dynamic routing and poor robustness under input corruptions. To address these issues, we propose IBCapsNet, a novel capsule architecture grounded in the Information Bottleneck (IB) principle. Instead of iterative routing, IBCapsNet employs a one-pass variational aggregation mechanism, where primary capsules are first compressed into a global context representation and then processed by class-specific variational autoencoders (VAEs) to infer latent capsules regularized by the KL divergence. This design enables efficient inference while inherently filtering out noise. Experiments on MNIST, Fashion-MNIST, SVHN and CIFAR-10 show that IBCapsNet matches CapsNet in clean-data accuracy (achieving 99.41% on MNIST and 92.01% on SVHN), yet significantly outperforms it under four types of synthetic noise - demonstrating average improvements of +17.10% and +14.54% for clamped additive and multiplicative noise, respectively. Moreover, IBCapsNet achieves 2.54x faster training and 3.64x higher inference throughput compared to CapsNet, while reducing model parameters by 4.66%. Our work bridges information-theoretic representation learning with capsule networks, offering a principled path toward robust, efficient, and interpretable deep models. Code is available at https://github.com/cxiang26/IBCapsnet

View on arXiv PDF Code

Similar