CVSep 5, 2022

Federated Zero-Shot Learning for Visual Recognition

Zhi Chen, Yadan Luo, Sen Wang, Jingjing Li, Zi Huang

arXiv:2209.01994v17.35 citationsh-index: 42

Originality Incremental advance

AI Analysis

This work addresses privacy concerns in zero-shot learning for organizations by enabling decentralized training without data sharing, though it is incremental as it builds on existing federated and zero-shot learning methods.

The paper tackles the problem of zero-shot learning for visual recognition while preserving data privacy by proposing a Federated Zero-Shot Learning (FedZSL) framework that learns from decentralized data on edge devices with non-overlapping classes, and it demonstrates effectiveness and robustness through experiments on three benchmark datasets.

Zero-shot learning is a learning regime that recognizes unseen classes by generalizing the visual-semantic relationship learned from the seen classes. To obtain an effective ZSL model, one may resort to curating training samples from multiple sources, which may inevitably raise the privacy concerns about data sharing across different organizations. In this paper, we propose a novel Federated Zero-Shot Learning FedZSL framework, which learns a central model from the decentralized data residing on edge devices. To better generalize to previously unseen classes, FedZSL allows the training data on each device sampled from the non-overlapping classes, which are far from the i.i.d. that traditional federated learning commonly assumes. We identify two key challenges in our FedZSL protocol: 1) the trained models are prone to be biased to the locally observed classes, thus failing to generalize to the unseen classes and/or seen classes appeared on other devices; 2) as each category in the training data comes from a single source, the central model is highly vulnerable to model replacement (backdoor) attacks. To address these issues, we propose three local objectives for visual-semantic alignment and cross-device alignment through relation distillation, which leverages the normalized class-wise covariance to regularize the consistency of the prediction logits across devices. To defend against the backdoor attacks, a feature magnitude defending technique is proposed. As malicious samples are less correlated to the given semantic attributes, the visual features of low magnitude will be discarded to stabilize model updates. The effectiveness and robustness of FedZSL are demonstrated by extensive experiments conducted on three zero-shot benchmark datasets.

View on arXiv PDF

Similar