DS LGMay 21, 2025

A Simple Approximation Algorithm for Optimal Decision Tree

arXiv:2505.15641v15.14 citationsh-index: 1

Originality Synthesis-oriented

AI Analysis

This work offers a simpler solution for applications like active learning and medical diagnosis, but it is incremental as it builds on existing approximation bounds.

The paper tackles the NP-hard optimal decision tree problem by providing a simple algorithm with an approximation ratio of 8 ln m, improving on the complexity and constant factors of prior O(ln m) approximations.

Optimal decision tree (\odt) is a fundamental problem arising in applications such as active learning, entity identification, and medical diagnosis. An instance of \odt is given by $m$ hypotheses, out of which an unknown ``true'' hypothesis is drawn according to some probability distribution. An algorithm needs to identify the true hypothesis by making queries: each query incurs a cost and has a known response for each hypothesis. The goal is to minimize the expected query cost to identify the true hypothesis. We consider the most general setting with arbitrary costs, probabilities and responses. \odt is NP-hard to approximate better than $\ln m$ and there are $O(\ln m)$ approximation algorithms known for it. However, these algorithms and/or their analyses are quite complex. Moreover, the leading constant factors are large. We provide a simple algorithm and analysis for \odt, proving an approximation ratio of $8 \ln m$.

View on arXiv PDF

Similar