NC LGMay 13

Feature Visualization Recovers Known Cortical Selectivity from TRIBE v2

arXiv:2605.139041.4

Predicted impact top 93% in NC · last 90 daysOriginality Incremental advance

AI Analysis

For neuroscientists and AI researchers, this provides a qualitative evaluation method for brain encoders, revealing whether they internalize functional brain organization beyond prediction accuracy.

The authors propose feature visualization as an interpretability technique for brain encoder models, demonstrating that it recovers known cortical selectivity (e.g., V1-V4 hierarchy, MT motion streaks, FFA faces, PPA lines) from TRIBE v2 with V-JEPA 2, and that optimized stimuli drive predicted regions ~4x more than natural images.

Brain encoder models predict cortical fMRI responses from the internal activations of pretrained vision and language networks, and are typically evaluated by held-out prediction accuracy. This is a useful signal for training but a poor one for interpretation: it tells us an encoder fits the data without telling us whether it has internalized the functional organization of the brain. We propose feature visualization -- gradient ascent on the encoder's predicted activation for a target region of interest (ROI) -- as a complementary interpretability technique, and apply it to TRIBE v2 composed with V-JEPA 2 (ViT-G, 40 layers), holding both frozen and synthesizing still images for seven regions spanning the ventral and dorsal visual hierarchies. Under identical hyperparameters, the probe recovers a visible progression of increasing spatial scale and feature complexity across V1 to V4, matching the ventral-stream hierarchy. It also produces three distinctive downstream regimes: radial "frozen-motion" streaks for the middle temporal area (MT) despite static-only optimization, face-like features for the fusiform face area (FFA), and consistent rectilinear line patterns for the parahippocampal place area (PPA). Optimized FFA stimuli drive the predicted region ~4x as much as a natural face photograph, consistent with feature visualization producing adversarial super-stimuli rather than canonical exemplars. The probe is simple, differentiable, and applicable to any brain encoder with a differentiable backbone, allowing for qualitative evaluation of brain encoders.

View on arXiv PDF

Similar