CVJun 15, 2023

Rosetta Neurons: Mining the Common Units in a Model Zoo

Berkeley
arXiv:2306.09346v249 citationsh-index: 111
AI Analysis

This addresses the problem of understanding shared representations in AI models for researchers, though it is incremental in exploring model interpretability.

The paper demonstrates the existence of 'Rosetta Neurons', common features across diverse neural networks for vision tasks, enabling model-to-model translation for manipulations like cross-class alignments without specialized training.

Do different neural networks, trained for various vision tasks, share some common representations? In this paper, we demonstrate the existence of common features we call "Rosetta Neurons" across a range of models with different architectures, different tasks (generative and discriminative), and different types of supervision (class-supervised, text-supervised, self-supervised). We present an algorithm for mining a dictionary of Rosetta Neurons across several popular vision models: Class Supervised-ResNet50, DINO-ResNet50, DINO-ViT, MAE, CLIP-ResNet50, BigGAN, StyleGAN-2, StyleGAN-XL. Our findings suggest that certain visual concepts and structures are inherently embedded in the natural world and can be learned by different models regardless of the specific task or architecture, and without the use of semantic labels. We can visualize shared concepts directly due to generative models included in our analysis. The Rosetta Neurons facilitate model-to-model translation enabling various inversion-based manipulations, including cross-class alignments, shifting, zooming, and more, without the need for specialized training.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes