LGDec 17, 2024

Open-Set Heterogeneous Domain Adaptation: Theoretical Analysis and Algorithm

arXiv:2412.13036v11 citationsh-index: 18Has CodeAAAI
Originality Incremental advance
AI Analysis

This addresses a real-world limitation in domain adaptation for scenarios with heterogeneous and open-set data, though it is incremental as it builds on existing heterogeneous DA methods.

The paper tackles the problem of domain adaptation when source and target domains have mismatched feature and label spaces, proposing a method that handles heterogeneity and identifies novel classes, with experiments showing effectiveness across text, image, and clinical data.

Domain adaptation (DA) tackles the issue of distribution shift by learning a model from a source domain that generalizes to a target domain. However, most existing DA methods are designed for scenarios where the source and target domain data lie within the same feature space, which limits their applicability in real-world situations. Recently, heterogeneous DA (HeDA) methods have been introduced to address the challenges posed by heterogeneous feature space between source and target domains. Despite their successes, current HeDA techniques fall short when there is a mismatch in both feature and label spaces. To address this, this paper explores a new DA scenario called open-set HeDA (OSHeDA). In OSHeDA, the model must not only handle heterogeneity in feature space but also identify samples belonging to novel classes. To tackle this challenge, we first develop a novel theoretical framework that constructs learning bounds for prediction error on target domain. Guided by this framework, we propose a new DA method called Representation Learning for OSHeDA (RL-OSHeDA). This method is designed to simultaneously transfer knowledge between heterogeneous data sources and identify novel classes. Experiments across text, image, and clinical data demonstrate the effectiveness of our algorithm. Model implementation is available at \url{https://github.com/pth1993/OSHeDA}.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes