CVAug 7, 2024

PushPull-Net: Inhibition-driven ResNet robust to image corruptions

arXiv:2408.04077v24 citationsh-index: 27
AI Analysis

This work addresses robustness issues for image classification models, particularly in handling corrupted data, though it appears incremental as it builds on existing ResNet frameworks.

The paper tackled the problem of improving neural network robustness to image corruptions by introducing a novel computational unit, PushPull-Conv, in ResNet architectures, resulting in a new benchmark with an mCE of 49.95% on ImageNet-C when combined with PRIME augmentation.

We introduce a novel computational unit, termed PushPull-Conv, in the first layer of a ResNet architecture, inspired by the anti-phase inhibition phenomenon observed in the primary visual cortex. This unit redefines the traditional convolutional layer by implementing a pair of complementary filters: a trainable push kernel and its counterpart, the pull kernel. The push kernel (analogous to traditional convolution) learns to respond to specific stimuli, while the pull kernel reacts to the same stimuli but of opposite contrast. This configuration enhances stimulus selectivity and effectively inhibits response in regions lacking preferred stimuli. This effect is attributed to the push and pull kernels, which produce responses of comparable magnitude in such regions, thereby neutralizing each other. The incorporation of the PushPull-Conv into ResNets significantly increases their robustness to image corruption. Our experiments with benchmark corruption datasets show that the PushPull-Conv can be combined with other data augmentation techniques to further improve model robustness. We set a new robustness benchmark on ResNet50 achieving an $mCE$ of 49.95$\%$ on ImageNet-C when combining PRIME augmentation with PushPull inhibition.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes