A Prototype-Oriented Clustering for Domain Shift with Source Privacy
This addresses domain shift in clustering with privacy protection, but it is incremental as it builds on existing methods.
The paper tackles unsupervised clustering under domain shift by introducing Prototype-oriented Clustering with Distillation (PCD), which improves performance and applicability while protecting source domain privacy, with experiments showing effectiveness across multiple benchmarks.
Unsupervised clustering under domain shift (UCDS) studies how to transfer the knowledge from abundant unlabeled data from multiple source domains to learn the representation of the unlabeled data in a target domain. In this paper, we introduce Prototype-oriented Clustering with Distillation (PCD) to not only improve the performance and applicability of existing methods for UCDS, but also address the concerns on protecting the privacy of both the data and model of the source domains. PCD first constructs a source clustering model by aligning the distributions of prototypes and data. It then distills the knowledge to the target model through cluster labels provided by the source model while simultaneously clustering the target data. Finally, it refines the target model on the target domain data without guidance from the source model. Experiments across multiple benchmarks show the effectiveness and generalizability of our source-private clustering method.