Coding schemes in neural networks learning classification tasks

arXiv:2406.16689v119 citations
Originality Incremental advance
AI Analysis

This work addresses the unclear nature of emergent representations in neural networks for researchers in machine learning theory, though it is incremental as it builds on prior findings in feature learning regimes.

The study investigated how neural networks develop internal representations, or 'coding schemes', during classification tasks, finding that the nature of these representations depends critically on neuronal nonlinearity, with linear networks showing analog coding and nonlinear ones exhibiting redundant or sparse coding due to symmetry breaking.

Neural networks posses the crucial ability to generate meaningful representations of task-dependent features. Indeed, with appropriate scaling, supervised learning in neural networks can result in strong, task-dependent feature learning. However, the nature of the emergent representations, which we call the `coding scheme', is still unclear. To understand the emergent coding scheme, we investigate fully-connected, wide neural networks learning classification tasks using the Bayesian framework where learning shapes the posterior distribution of the network weights. Consistent with previous findings, our analysis of the feature learning regime (also known as `non-lazy', `rich', or `mean-field' regime) shows that the networks acquire strong, data-dependent features. Surprisingly, the nature of the internal representations depends crucially on the neuronal nonlinearity. In linear networks, an analog coding scheme of the task emerges. Despite the strong representations, the mean predictor is identical to the lazy case. In nonlinear networks, spontaneous symmetry breaking leads to either redundant or sparse coding schemes. Our findings highlight how network properties such as scaling of weights and neuronal nonlinearity can profoundly influence the emergent representations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes