DS LGNov 16, 2022

On the Power of Learning-Augmented Search Trees

Jingbang Chen, Xinyuan Cao, Alicia Stepin, Li Chen

arXiv:2211.09251v34 citationsh-index: 4

Originality Incremental advance

AI Analysis

This work addresses the efficiency of search trees in data structures and algorithms, offering a robust and generalizable solution with incremental advancements over previous learning-augmented approaches.

The paper tackles the problem of designing learning-augmented binary search trees (BSTs) that achieve static optimality for arbitrary input distributions, extending prior work limited to Zipfian distributions, and demonstrates empirical performance improvements over existing methods.

We study learning-augmented binary search trees (BSTs) via Treaps with carefully designed priorities. The result is a simple search tree in which the depth of each item $x$ is determined by its predicted weight $w_x$. Specifically, each item $x$ is assigned a composite priority of $-\lfloor\log\log(1/w_x)\rfloor + U(0, 1)$ where $U(0, 1)$ is the uniform random variable. By choosing $w_x$ as the relative frequency of $x$, the resulting search trees achieve static optimality. This approach generalizes the recent learning-augmented BSTs [Lin-Luo-Woodruff ICML '22], which only work for Zipfian distributions, by extending them to arbitrary input distributions. Furthermore, we demonstrate that our method can be generalized to a B-Tree data structure using the B-Treap approach [Golovin ICALP '09]. Our search trees are also capable of leveraging localities in the access sequence through online self-reorganization, thereby achieving the working-set property. Additionally, they are robust to prediction errors and support dynamic operations, such as insertions, deletions, and prediction updates. We complement our analysis with an empirical study, demonstrating that our method outperforms prior work and classic data structures.

View on arXiv PDF

Similar