LG AT MLAug 9, 2023

Decorrelating neurons using persistence

Rubén Ballester, Carles Casacuberta, Sergio Escalera

arXiv:2308.04870v13.82 citationsh-index: 70Has Code

Originality Incremental advance

AI Analysis

This work addresses generalization issues in deep learning models for tasks like classification and regression, offering a novel regularization approach, though it appears incremental as it builds on existing topological persistence concepts.

The authors tackled the problem of improving generalization in deep learning by reducing high correlations between neurons, proposing two novel regularization terms based on topological persistence that outperform existing methods and achieve higher accuracies than naive correlation minimization.

We propose a novel way to improve the generalisation capacity of deep learning models by reducing high correlations between neurons. For this, we present two regularisation terms computed from the weights of a minimum spanning tree of the clique whose vertices are the neurons of a given network (or a sample of those), where weights on edges are correlation dissimilarities. We provide an extensive set of experiments to validate the effectiveness of our terms, showing that they outperform popular ones. Also, we demonstrate that naive minimisation of all correlations between neurons obtains lower accuracies than our regularisation terms, suggesting that redundancies play a significant role in artificial neural networks, as evidenced by some studies in neuroscience for real networks. We include a proof of differentiability of our regularisers, thus developing the first effective topological persistence-based regularisation terms that consider the whole set of neurons and that can be applied to a feedforward architecture in any deep learning task such as classification, data generation, or regression.

View on arXiv PDF Code

Similar