LG SCJan 22

Analyzing Neural Network Information Flow Using Differential Geometry

Shuhang Tan, Jayson Sia, Paul Bogdan, Radoslav Ivanov

arXiv:2601.16366v11.4h-index: 7

Originality Incremental advance

AI Analysis

This provides a tool for symbolic neural network analysis, such as robustness analysis or model repair, by offering a novel method to understand data flow, though it is incremental as it applies an existing graph theory concept to neural networks.

This paper tackles the problem of identifying important connections in neural networks for model analysis and repair by using graph curvature, specifically Ollivier-Ricci curvature, to rank edges based on importance. The results show that removing negative-curvature edges degrades performance, and the method identifies more unimportant edges than state-of-the-art pruning methods on image datasets like MNIST, CIFAR-10, and CIFAR-100.

This paper provides a fresh view of the neural network (NN) data flow problem, i.e., identifying the NN connections that are most important for the performance of the full model, through the lens of graph theory. Understanding the NN data flow provides a tool for symbolic NN analysis, e.g.,~robustness analysis or model repair. Unlike the standard approach to NN data flow analysis, which is based on information theory, we employ the notion of graph curvature, specifically Ollivier-Ricci curvature (ORC). The ORC has been successfully used to identify important graph edges in various domains such as road traffic analysis, biological and social networks. In particular, edges with negative ORC are considered bottlenecks and as such are critical to the graph's overall connectivity, whereas positive-ORC edges are not essential. We use this intuition for the case of NNs as well: we 1)~construct a graph induced by the NN structure and introduce the notion of neural curvature (NC) based on the ORC; 2)~calculate curvatures based on activation patterns for a set of input examples; 3)~aim to demonstrate that NC can indeed be used to rank edges according to their importance for the overall NN functionality. We evaluate our method through pruning experiments and show that removing negative-ORC edges quickly degrades the overall NN performance, whereas positive-ORC edges have little impact. The proposed method is evaluated on a variety of models trained on three image datasets, namely MNIST, CIFAR-10 and CIFAR-100. The results indicate that our method can identify a larger number of unimportant edges as compared to state-of-the-art pruning methods.

View on arXiv PDF

Similar