LGSep 22, 2024

From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks

arXiv:2409.14623v244 citationsh-index: 32
Originality Incremental advance
AI Analysis

This provides incremental theoretical insights into learning regimes, relevant for neuroscience and applications like continual learning.

The paper tackled how weight initialization influences learning dynamics in deep linear neural networks, deriving exact solutions for lambda-balanced initializations that capture representation evolution from rich to lazy regimes.

Biological and artificial neural networks develop internal representations that enable them to perform complex tasks. In artificial networks, the effectiveness of these models relies on their ability to build task specific representation, a process influenced by interactions among datasets, architectures, initialization strategies, and optimization algorithms. Prior studies highlight that different initializations can place networks in either a lazy regime, where representations remain static, or a rich/feature learning regime, where representations evolve dynamically. Here, we examine how initialization influences learning dynamics in deep linear neural networks, deriving exact solutions for lambda-balanced initializations-defined by the relative scale of weights across layers. These solutions capture the evolution of representations and the Neural Tangent Kernel across the spectrum from the rich to the lazy regimes. Our findings deepen the theoretical understanding of the impact of weight initialization on learning regimes, with implications for continual learning, reversal learning, and transfer learning, relevant to both neuroscience and practical applications.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes