LGDec 14, 2021

Federated Nearest Neighbor Classification with a Colony of Fruit-Flies: With Supplement

arXiv:2112.07157v16 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of efficient and private nearest neighbor classification in federated learning for distributed data scenarios, representing an incremental advancement.

The paper tackles the problem of performing nearest neighbor classification in federated learning without sharing data, by reprogramming a fruit-fly-inspired hash and bloom filter to create FlyNN, which matches the accuracy of the canonical method across 70 datasets and achieves up to 8x speedup with 16 parties.

The mathematical formalization of a neurological mechanism in the olfactory circuit of a fruit-fly as a locality sensitive hash (Flyhash) and bloom filter (FBF) has been recently proposed and "reprogrammed" for various machine learning tasks such as similarity search, outlier detection and text embeddings. We propose a novel reprogramming of this hash and bloom filter to emulate the canonical nearest neighbor classifier (NNC) in the challenging Federated Learning (FL) setup where training and test data are spread across parties and no data can leave their respective parties. Specifically, we utilize Flyhash and FBF to create the FlyNN classifier, and theoretically establish conditions where FlyNN matches NNC. We show how FlyNN is trained exactly in a FL setup with low communication overhead to produce FlyNNFL, and how it can be differentially private. Empirically, we demonstrate that (i) FlyNN matches NNC accuracy across 70 OpenML datasets, (ii) FlyNNFL training is highly scalable with low communication overhead, providing up to $8\times$ speedup with $16$ parties.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes