LGAICRJul 19, 2023

Eliminating Label Leakage in Tree-Based Vertical Federated Learning

arXiv:2307.10318v28 citationsh-index: 65
Originality Incremental advance
AI Analysis

This addresses security vulnerabilities in federated learning for parties with disjoint features, though it is incremental as it builds on existing tree-based models and defense techniques.

The paper tackled the problem of label leakage in tree-based vertical federated learning by introducing a novel attack called ID2Graph that deduces private training labels, and proposed defense mechanisms like Grafting-LDP and ID-LMID to mitigate this risk, with comprehensive experiments showing significant risks and effective mitigation.

Vertical federated learning (VFL) enables multiple parties with disjoint features of a common user set to train a machine learning model without sharing their private data. Tree-based models have become prevalent in VFL due to their interpretability and efficiency. However, the vulnerability of tree-based VFL has not been sufficiently investigated. In this study, we first introduce a novel label inference attack, ID2Graph, which utilizes the sets of record IDs assigned to each node (i.e., instance space)to deduce private training labels. ID2Graph attack generates a graph structure from training samples, extracts communities from the graph, and clusters the local dataset using community information. To counteract label leakage from the instance space, we propose two effective defense mechanisms, Grafting-LDP, which improves the utility of label differential privacy with post-processing, and andID-LMID, which focuses on mutual information regularization. Comprehensive experiments on various datasets reveal that ID2Graph presents significant risks to tree-based models such as RandomForest and XGBoost. Further evaluations of these benchmarks demonstrate that our defense methods effectively mitigate label leakage in such instances

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes