LGApr 1, 2025

Benchmarking Federated Machine Unlearning methods for Tabular Data

Chenguang Xiao, Abhirup Ghosh, Han Wu, Shuo Wang, Diederick van Thiel

arXiv:2504.00921v14.1h-index: 2

Originality Incremental advance

AI Analysis

This work addresses privacy and efficiency challenges in federated learning for tabular data, providing a foundational benchmark for incremental improvements in machine unlearning.

The paper benchmarks machine unlearning methods in federated learning for tabular data, finding that tree-based models achieve high certifiability for exact unlearning while gradient-based methods offer better computational efficiency.

Machine unlearning, which enables a model to forget specific data upon request, is increasingly relevant in the era of privacy-centric machine learning, particularly within federated learning (FL) environments. This paper presents a pioneering study on benchmarking machine unlearning methods within a federated setting for tabular data, addressing the unique challenges posed by cross-silo FL where data privacy and communication efficiency are paramount. We explore unlearning at the feature and instance levels, employing both machine learning, random forest and logistic regression models. Our methodology benchmarks various unlearning algorithms, including fine-tuning and gradient-based approaches, across multiple datasets, with metrics focused on fidelity, certifiability, and computational efficiency. Experiments demonstrate that while fidelity remains high across methods, tree-based models excel in certifiability, ensuring exact unlearning, whereas gradient-based methods show improved computational efficiency. This study provides critical insights into the design and selection of unlearning algorithms tailored to the FL environment, offering a foundation for further research in privacy-preserving machine learning.

View on arXiv PDF

Similar