LGOct 21, 2025

Empowering Decision Trees via Shape Function Branching

arXiv:2510.19040v1h-index: 2
Originality Highly original
AI Analysis

This work addresses the problem of interpretability and efficiency in decision trees for users in tabular data analysis, representing a novel method for a known bottleneck.

The paper tackles the limitation of decision trees in capturing non-linear feature effects by proposing Shape Generalized Trees (SGTs), which use learnable shape functions for richer partitioning, resulting in superior performance and reduced model size compared to traditional trees.

Decision trees are prized for their interpretability and strong performance on tabular data. Yet, their reliance on simple axis-aligned linear splits often forces deep, complex structures to capture non-linear feature effects, undermining human comprehension of the constructed tree. To address this limitation, we propose a novel generalization of a decision tree, the Shape Generalized Tree (SGT), in which each internal node applies a learnable axis-aligned shape function to a single feature, enabling rich, non-linear partitioning in one split. As users can easily visualize each node's shape function, SGTs are inherently interpretable and provide intuitive, visual explanations of the model's decision mechanisms. To learn SGTs from data, we propose ShapeCART, an efficient induction algorithm for SGTs. We further extend the SGT framework to bivariate shape functions (S$^2$GT) and multi-way trees (SGT$_K$), and present Shape$^2$CART and ShapeCART$_K$, extensions to ShapeCART for learning S$^2$GTs and SGT$_K$s, respectively. Experiments on various datasets show that SGTs achieve superior performance with reduced model size compared to traditional axis-aligned linear trees.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes