LG AI MLSep 14, 2023

Multi-Source Domain Adaptation meets Dataset Distillation through Dataset Dictionary Learning

Eduardo Fernandes Montesuma, Fred Ngolè Mboula, Antoine Souloumiac

arXiv:2309.07666v19.87 citationsh-index: 17

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of adapting multiple labeled source domains to an unlabeled target domain while synthesizing compact dataset summaries, which is incremental as it builds on prior methods.

The paper tackles the combined problem of multi-source domain adaptation and dataset distillation by proposing a method that adapts existing techniques, achieving state-of-the-art adaptation performance with as little as 1 sample per class on four benchmarks.

In this paper, we consider the intersection of two problems in machine learning: Multi-Source Domain Adaptation (MSDA) and Dataset Distillation (DD). On the one hand, the first considers adapting multiple heterogeneous labeled source domains to an unlabeled target domain. On the other hand, the second attacks the problem of synthesizing a small summary containing all the information about the datasets. We thus consider a new problem called MSDA-DD. To solve it, we adapt previous works in the MSDA literature, such as Wasserstein Barycenter Transport and Dataset Dictionary Learning, as well as DD method Distribution Matching. We thoroughly experiment with this novel problem on four benchmarks (Caltech-Office 10, Tennessee-Eastman Process, Continuous Stirred Tank Reactor, and Case Western Reserve University), where we show that, even with as little as 1 sample per class, one achieves state-of-the-art adaptation performance.

View on arXiv PDF

Similar