LGATMLAug 9, 2023

Decorrelating neurons using persistence

arXiv:2308.04870v12 citationsh-index: 70
Originality Incremental advance
AI Analysis

This work addresses generalization issues in deep learning models for tasks like classification and regression, offering a novel regularization approach, though it appears incremental as it builds on existing topological persistence concepts.

The authors tackled the problem of improving generalization in deep learning by reducing high correlations between neurons, proposing two novel regularization terms based on topological persistence that outperform existing methods and achieve higher accuracies than naive correlation minimization.

We propose a novel way to improve the generalisation capacity of deep learning models by reducing high correlations between neurons. For this, we present two regularisation terms computed from the weights of a minimum spanning tree of the clique whose vertices are the neurons of a given network (or a sample of those), where weights on edges are correlation dissimilarities. We provide an extensive set of experiments to validate the effectiveness of our terms, showing that they outperform popular ones. Also, we demonstrate that naive minimisation of all correlations between neurons obtains lower accuracies than our regularisation terms, suggesting that redundancies play a significant role in artificial neural networks, as evidenced by some studies in neuroscience for real networks. We include a proof of differentiability of our regularisers, thus developing the first effective topological persistence-based regularisation terms that consider the whole set of neurons and that can be applied to a feedforward architecture in any deep learning task such as classification, data generation, or regression.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes