LGAIMLMar 9, 2023

TANGOS: Regularizing Tabular Neural Networks through Gradient Orthogonalization and Specialization

Cambridge
arXiv:2303.05506v136 citationsh-index: 74
Originality Incremental advance
AI Analysis

This addresses the challenge of applying deep learning effectively to structured tabular data, offering a novel regularization approach that can be combined with existing techniques for incremental gains.

The paper tackles the problem of overfitting in deep neural networks for tabular data by introducing TANGOS, a regularization framework based on gradient orthogonalization and specialization, which improves out-of-sample generalization performance compared to other methods.

Despite their success with unstructured data, deep neural networks are not yet a panacea for structured tabular data. In the tabular domain, their efficiency crucially relies on various forms of regularization to prevent overfitting and provide strong generalization performance. Existing regularization techniques include broad modelling decisions such as choice of architecture, loss functions, and optimization methods. In this work, we introduce Tabular Neural Gradient Orthogonalization and Specialization (TANGOS), a novel framework for regularization in the tabular setting built on latent unit attributions. The gradient attribution of an activation with respect to a given input feature suggests how the neuron attends to that feature, and is often employed to interpret the predictions of deep networks. In TANGOS, we take a different approach and incorporate neuron attributions directly into training to encourage orthogonalization and specialization of latent attributions in a fully-connected network. Our regularizer encourages neurons to focus on sparse, non-overlapping input features and results in a set of diverse and specialized latent units. In the tabular domain, we demonstrate that our approach can lead to improved out-of-sample generalization performance, outperforming other popular regularization methods. We provide insight into why our regularizer is effective and demonstrate that TANGOS can be applied jointly with existing methods to achieve even greater generalization performance.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes