LGDec 14, 2020

Graphs for deep learning representations

arXiv:2012.07439v12 citations
AI Analysis

This work aims to improve the interpretability and efficiency of deep learning models by providing a structured way to analyze their internal representations.

This thesis introduces a graph formalism, based on Graph Signal Processing, to represent the latent spaces of deep neural networks. This approach allows for the investigation of deep learning architectures to address issues such as generalization, design choices, robustness to perturbations, and computational complexity.

In recent years, Deep Learning methods have achieved state of the art performance in a vast range of machine learning tasks, including image classification and multilingual automatic text translation. These architectures are trained to solve machine learning tasks in an end-to-end fashion. In order to reach top-tier performance, these architectures often require a very large number of trainable parameters. There are multiple undesirable consequences, and in order to tackle these issues, it is desired to be able to open the black boxes of deep learning architectures. Problematically, doing so is difficult due to the high dimensionality of representations and the stochasticity of the training process. In this thesis, we investigate these architectures by introducing a graph formalism based on the recent advances in Graph Signal Processing (GSP). Namely, we use graphs to represent the latent spaces of deep neural networks. We showcase that this graph formalism allows us to answer various questions including: ensuring generalization abilities, reducing the amount of arbitrary choices in the design of the learning process, improving robustness to small perturbations added to the inputs, and reducing computational complexity

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes