NCLGMay 27, 2022

Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks

arXiv:2205.13816v110 citationsh-index: 55
Originality Synthesis-oriented
AI Analysis

This work identifies shared computational principles between artificial and biological vision systems, which is incremental as it builds on prior observations in both fields.

The study investigated common principles in visual information reformatting between deep convolutional neural networks (CNNs) and the rat ventral stream, finding that both systems exhibit similar expansion-contraction phases in intrinsic dimensionality and training-induced distillation and pruning of image information to support invariant discrimination.

Visual object recognition has been extensively studied in both neuroscience and computer vision. Recently, the most popular class of artificial systems for this task, deep convolutional neural networks (CNNs), has been shown to provide excellent models for its functional analogue in the brain, the ventral stream in visual cortex. This has prompted questions on what, if any, are the common principles underlying the reformatting of visual information as it flows through a CNN or the ventral stream. Here we consider some prominent statistical patterns that are known to exist in the internal representations of either CNNs or the visual cortex and look for them in the other system. We show that intrinsic dimensionality (ID) of object representations along the rat homologue of the ventral stream presents two distinct expansion-contraction phases, as previously shown for CNNs. Conversely, in CNNs, we show that training results in both distillation and active pruning (mirroring the increase in ID) of low- to middle-level image information in single units, as representations gain the ability to support invariant discrimination, in agreement with previous observations in rat visual cortex. Taken together, our findings suggest that CNNs and visual cortex share a similarly tight relationship between dimensionality expansion/reduction of object representations and reformatting of image information.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes