LG CVNov 5, 2023

Hierarchical Simplicity Bias of Neural Networks

arXiv:2311.02622v2h-index: 2

Originality Incremental advance

AI Analysis

This work addresses the problem of understanding implicit biases in neural networks for researchers, revealing incremental insights into how networks learn features hierarchically.

The paper investigates hierarchical simplicity bias in neural networks, showing that networks prioritize simpler features over more complex ones, even when both are equally predictive, and demonstrates that last-layer retraining fails to recover core features when spurious features correlate perfectly with labels in synthetic datasets.

Neural networks often exhibit simplicity bias, favoring simpler features over more complex ones, even when both are equally predictive. We introduce a novel method called imbalanced label coupling to explore and extend this simplicity bias across multiple hierarchical levels. Our approach demonstrates that trained networks sequentially consider features of increasing complexity based on their correlation with labels in the training set, regardless of their actual predictive power. For example, in CIFAR-10, simple spurious features can cause misclassifications where most cats are predicted as dogs and most trucks as automobiles. We empirically show that last-layer retraining with target data distribution \citep{kirichenko2022last} is insufficient to fully recover core features when spurious features perfectly correlate with target labels in our synthetic datasets. Our findings deepen the understanding of the implicit biases inherent in neural networks.

View on arXiv PDF

Similar