LG NEDec 13, 2021

On the Choice of General Purpose Classifiers in Learned Bloom Filters: An Initial Analysis Within Basic Filters

Giacomo Fumagalli, Davide Raimondi, Raffaele Giancarlo, Dario Malchiodi, Marco Frasca

arXiv:2112.06563v13.111 citations

Originality Synthesis-oriented

AI Analysis

This work addresses a specific gap in the design of learned data structures for practitioners, but it is incremental as it builds on existing learned Bloom filter frameworks.

The paper tackles the problem of selecting appropriate classifiers for Learned Bloom Filters, a key component affecting performance, by conducting an initial analysis and providing guidelines on choosing among five classic classification paradigms.

Bloom Filters are a fundamental and pervasive data structure. Within the growing area of Learned Data Structures, several Learned versions of Bloom Filters have been considered, yielding advantages over classic Filters. Each of them uses a classifier, which is the Learned part of the data structure. Although it has a central role in those new filters, and its space footprint as well as classification time may affect the performance of the Learned Filter, no systematic study of which specific classifier to use in which circumstances is available. We report progress in this area here, providing also initial guidelines on which classifier to choose among five classic classification paradigms.

View on arXiv PDF

Similar