LGMLSep 13, 2019

Neural Oblivious Decision Ensembles for Deep Learning on Tabular Data

arXiv:1909.06312v2412 citationsHas Code
Originality Incremental advance
AI Analysis

This addresses the challenge of applying deep learning effectively to heterogeneous tabular data, which is crucial for domains relying on tabular datasets, though it appears incremental as it builds on existing tree-based methods.

The authors tackled the problem of deep learning underperforming on tabular data compared to gradient boosting decision trees (GBDT) by introducing Neural Oblivious Decision Ensembles (NODE), a new architecture that outperformed leading GBDT packages on most tasks in extensive experiments.

Nowadays, deep neural networks (DNNs) have become the main instrument for machine learning tasks within a wide range of domains, including vision, NLP, and speech. Meanwhile, in an important case of heterogenous tabular data, the advantage of DNNs over shallow counterparts remains questionable. In particular, there is no sufficient evidence that deep learning machinery allows constructing methods that outperform gradient boosting decision trees (GBDT), which are often the top choice for tabular problems. In this paper, we introduce Neural Oblivious Decision Ensembles (NODE), a new deep learning architecture, designed to work with any tabular data. In a nutshell, the proposed NODE architecture generalizes ensembles of oblivious decision trees, but benefits from both end-to-end gradient-based optimization and the power of multi-layer hierarchical representation learning. With an extensive experimental comparison to the leading GBDT packages on a large number of tabular datasets, we demonstrate the advantage of the proposed NODE architecture, which outperforms the competitors on most of the tasks. We open-source the PyTorch implementation of NODE and believe that it will become a universal framework for machine learning on tabular data.

Code Implementations5 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes