LG CV MLOct 15, 2019

The Local Elasticity of Neural Networks

arXiv:1910.06943v218.053 citationsHas Code

Originality Incremental advance

AI Analysis

This work provides insights into the behavior of neural networks, potentially aiding in understanding their generalization and robustness, though it is incremental in building on existing concepts like the neural tangent kernel.

The paper introduces 'local elasticity', a phenomenon where neural network predictions at dissimilar feature vectors remain stable after SGD updates, and demonstrates it through simulations on real and synthetic datasets, showing it does not occur in linear classifiers.

This paper presents a phenomenon in neural networks that we refer to as \textit{local elasticity}. Roughly speaking, a classifier is said to be locally elastic if its prediction at a feature vector $\bx'$ is \textit{not} significantly perturbed, after the classifier is updated via stochastic gradient descent at a (labeled) feature vector $\bx$ that is \textit{dissimilar} to $\bx'$ in a certain sense. This phenomenon is shown to persist for neural networks with nonlinear activation functions through extensive simulations on real-life and synthetic datasets, whereas this is not observed in linear classifiers. In addition, we offer a geometric interpretation of local elasticity using the neural tangent kernel \citep{jacot2018neural}. Building on top of local elasticity, we obtain pairwise similarity measures between feature vectors, which can be used for clustering in conjunction with $K$-means. The effectiveness of the clustering algorithm on the MNIST and CIFAR-10 datasets in turn corroborates the hypothesis of local elasticity of neural networks on real-life data. Finally, we discuss some implications of local elasticity to shed light on several intriguing aspects of deep neural networks.

View on arXiv PDF Code

Similar