LGSep 30, 2023

Mitigating the Effect of Incidental Correlations on Part-based Learning

Gaurav Bhatt, Deepayan Das, Leonid Sigal, Vineeth N Balasubramanian

arXiv:2310.00377v19.87 citationsh-index: 9Has Code

Originality Incremental advance

AI Analysis

This work addresses a key limitation in part-based learning for AI systems, improving interpretability and generalization with limited data, though it is incremental as it builds on existing regularization techniques.

The paper tackles the problem of incidental correlations in part-based learning, which hinder generalization and interpretability, by introducing two regularization methods that achieve state-of-the-art performance on few-shot learning benchmarks like MiniImagenet, TieredImageNet, and FC100, and show improved generalization under domain shifts and data corruption on ImageNet-9.

Intelligent systems possess a crucial characteristic of breaking complicated problems into smaller reusable components or parts and adjusting to new tasks using these part representations. However, current part-learners encounter difficulties in dealing with incidental correlations resulting from the limited observations of objects that may appear only in specific arrangements or with specific backgrounds. These incidental correlations may have a detrimental impact on the generalization and interpretability of learned part representations. This study asserts that part-based representations could be more interpretable and generalize better with limited data, employing two innovative regularization methods. The first regularization separates foreground and background information's generative process via a unique mixture-of-parts formulation. Structural constraints are imposed on the parts using a weakly-supervised loss, guaranteeing that the mixture-of-parts for foreground and background entails soft, object-agnostic masks. The second regularization assumes the form of a distillation loss, ensuring the invariance of the learned parts to the incidental background correlations. Furthermore, we incorporate sparse and orthogonal constraints to facilitate learning high-quality part representations. By reducing the impact of incidental background correlations on the learned parts, we exhibit state-of-the-art (SoTA) performance on few-shot learning tasks on benchmark datasets, including MiniImagenet, TieredImageNet, and FC100. We also demonstrate that the part-based representations acquired through our approach generalize better than existing techniques, even under domain shifts of the background and common data corruption on the ImageNet-9 dataset. The implementation is available on GitHub: https://github.com/GauravBh1010tt/DPViT.git

View on arXiv PDF Code

Similar