CV IVMar 15

Make it SING: Analyzing Semantic Invariants in Classifiers

Harel Yadid, Meir Yossef Levi, Roy Betser, Guy Gilboa

arXiv:2603.1461025.4h-index: 5

AI Analysis

This addresses the challenge of interpreting classifier invariants for researchers and practitioners in machine learning, offering a novel analysis tool.

The paper tackles the problem of understanding semantic invariants in classifiers by introducing SING, a method that maps network features to vision-language models to provide natural language descriptions and visual examples of semantic shifts, revealing that ResNet50 leaks semantic attributes while DinoViT better maintains class semantics.

All classifiers, including state-of-the-art vision models, possess invariants, partially rooted in the geometry of their linear mappings. These invariants, which reside in the null-space of the classifier, induce equivalent sets of inputs that map to identical outputs. The semantic content of these invariants remains vague, as existing approaches struggle to provide human-interpretable information. To address this gap, we present Semantic Interpretation of the Null-space Geometry (SING), a method that constructs equivalent images, with respect to the network, and assigns semantic interpretations to the available variations. We use a mapping from network features to multi-modal vision language models. This allows us to obtain natural language descriptions and visual examples of the induced semantic shifts. SING can be applied to a single image, uncovering local invariants, or to sets of images, allowing a breadth of statistical analysis at the class and model levels. For example, our method reveals that ResNet50 leaks relevant semantic attributes to the null space, whereas DinoViT, a ViT pretrained with self-supervised DINO, is superior in maintaining class semantics across the invariant space.

View on arXiv PDF

Similar