DSLGMay 21, 2025

A Simple Approximation Algorithm for Optimal Decision Tree

arXiv:2505.15641v14 citationsh-index: 1
Originality Synthesis-oriented
AI Analysis

This work offers a simpler solution for applications like active learning and medical diagnosis, but it is incremental as it builds on existing approximation bounds.

The paper tackles the NP-hard optimal decision tree problem by providing a simple algorithm with an approximation ratio of 8 ln m, improving on the complexity and constant factors of prior O(ln m) approximations.

Optimal decision tree (\odt) is a fundamental problem arising in applications such as active learning, entity identification, and medical diagnosis. An instance of \odt is given by $m$ hypotheses, out of which an unknown ``true'' hypothesis is drawn according to some probability distribution. An algorithm needs to identify the true hypothesis by making queries: each query incurs a cost and has a known response for each hypothesis. The goal is to minimize the expected query cost to identify the true hypothesis. We consider the most general setting with arbitrary costs, probabilities and responses. \odt is NP-hard to approximate better than $\ln m$ and there are $O(\ln m)$ approximation algorithms known for it. However, these algorithms and/or their analyses are quite complex. Moreover, the leading constant factors are large. We provide a simple algorithm and analysis for \odt, proving an approximation ratio of $8 \ln m$.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes