LGDSCDOct 19, 2021

Expressivity of Neural Networks via Chaotic Itineraries beyond Sharkovsky's Theorem

arXiv:2110.10295v11 citations
Originality Highly original
AI Analysis

This work addresses expressivity tradeoffs in neural networks for researchers in machine learning theory, providing foundational insights beyond prior incremental analyses.

The paper tackles the problem of determining the required size of neural networks to approximate target functions, showing that considering chaotic itineraries instead of just periodic points leads to stronger exponential depth-width tradeoffs, with nearly-optimal bounds that tighten as period increases and handle constant L1 error.

Given a target function $f$, how large must a neural network be in order to approximate $f$? Recent works examine this basic question on neural network \textit{expressivity} from the lens of dynamical systems and provide novel ``depth-vs-width'' tradeoffs for a large family of functions $f$. They suggest that such tradeoffs are governed by the existence of \textit{periodic} points or \emph{cycles} in $f$. Our work, by further deploying dynamical systems concepts, illuminates a more subtle connection between periodicity and expressivity: we prove that periodic points alone lead to suboptimal depth-width tradeoffs and we improve upon them by demonstrating that certain ``chaotic itineraries'' give stronger exponential tradeoffs, even in regimes where previous analyses only imply polynomial gaps. Contrary to prior works, our bounds are nearly-optimal, tighten as the period increases, and handle strong notions of inapproximability (e.g., constant $L_1$ error). More broadly, we identify a phase transition to the \textit{chaotic regime} that exactly coincides with an abrupt shift in other notions of function complexity, including VC-dimension and topological entropy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes