CVAIApr 8

Training Deep Visual Networks Beyond Loss and Accuracy Through a Dynamical Systems Approach

arXiv:2604.0971610.3h-index: 5
Predicted impact top 99% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For researchers studying deep learning training dynamics, this provides an exploratory framework to understand internal representation changes beyond standard loss/accuracy metrics.

This paper introduces dynamical systems measures (integration, metastability, stability index) from layer activations to monitor internal representation changes during training of deep visual networks. Applied to nine model-dataset combinations, the measures distinguish CIFAR-10 from CIFAR-100, may signal convergence before accuracy plateaus, and reveal different training behaviors.

Deep visual recognition models are usually trained and evaluated using metrics such as loss and accuracy. While these measures show whether a model is improving, they reveal very little about how its internal representations change during training. This paper introduces a complementary way to study that process by examining training through the lens of dynamical systems. Drawing on ideas from signal analysis originally used to study biological neural activity, we define three measures from layer activations collected across training epochs: an integration score that reflects long-range coordination across layers, a metastability score that captures how flexibly the network shifts between more and less synchronised states, and a combined dynamical stability index. We apply this framework to nine combinations of model architecture and dataset, including several ResNet variants, DenseNet-121, MobileNetV2, VGG-16, and a pretrained Vision Transformer on CIFAR-10 and CIFAR-100. The results suggest three main patterns. First, the integration measure consistently distinguishes the easier CIFAR-10 setting from the more difficult CIFAR-100 setting. Second, changes in the volatility of the stability index may provide an early sign of convergence before accuracy fully plateaus. Third, the relationship between integration and metastability appears to reflect different styles of training behaviour. Overall, this study offers an exploratory but promising new way to understand deep visual training beyond loss and accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes