LGMLDec 11, 2024

GLL: A Differentiable Graph Learning Layer for Neural Networks

arXiv:2412.08016v1h-index: 4
Originality Incremental advance
AI Analysis

This work addresses the need for better integration of graph-based learning in neural networks for classification tasks, though it is incremental as it builds on prior heuristic combinations.

The authors tackled the problem of standard deep learning architectures not leveraging relational information between samples by integrating a differentiable graph learning layer into neural networks, resulting in improved robustness to adversarial attacks, generalization, and training dynamics compared to softmax-based methods.

Standard deep learning architectures used for classification generate label predictions with a projection head and softmax activation function. Although successful, these methods fail to leverage the relational information between samples in the batch for generating label predictions. In recent works, graph-based learning techniques, namely Laplace learning, have been heuristically combined with neural networks for both supervised and semi-supervised learning (SSL) tasks. However, prior works approximate the gradient of the loss function with respect to the graph learning algorithm or decouple the processes; end-to-end integration with neural networks is not achieved. In this work, we derive backpropagation equations, via the adjoint method, for inclusion of a general family of graph learning layers into a neural network. This allows us to precisely integrate graph Laplacian-based label propagation into a neural network layer, replacing a projection head and softmax activation function for classification tasks. Using this new framework, our experimental results demonstrate smooth label transitions across data, improved robustness to adversarial attacks, improved generalization, and improved training dynamics compared to the standard softmax-based approach.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes