LGNEDec 13, 2021

On the Choice of General Purpose Classifiers in Learned Bloom Filters: An Initial Analysis Within Basic Filters

arXiv:2112.06563v111 citations
Originality Synthesis-oriented
AI Analysis

This work addresses a specific gap in the design of learned data structures for practitioners, but it is incremental as it builds on existing learned Bloom filter frameworks.

The paper tackles the problem of selecting appropriate classifiers for Learned Bloom Filters, a key component affecting performance, by conducting an initial analysis and providing guidelines on choosing among five classic classification paradigms.

Bloom Filters are a fundamental and pervasive data structure. Within the growing area of Learned Data Structures, several Learned versions of Bloom Filters have been considered, yielding advantages over classic Filters. Each of them uses a classifier, which is the Learned part of the data structure. Although it has a central role in those new filters, and its space footprint as well as classification time may affect the performance of the Learned Filter, no systematic study of which specific classifier to use in which circumstances is available. We report progress in this area here, providing also initial guidelines on which classifier to choose among five classic classification paradigms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes