LGAIFeb 6, 2025

A High-Dimensional Statistical Method for Optimizing Transfer Quantities in Multi-Source Transfer Learning

arXiv:2502.04242v43 citationsh-index: 3Has Code
Originality Highly original
AI Analysis

This addresses data scarcity in supervised learning by optimizing multi-source transfer learning, representing a novel method for a known bottleneck.

The paper tackles the problem of determining optimal sample quantities from multiple source tasks in transfer learning, developing a theoretical framework and algorithm (OTQMS) that significantly outperforms state-of-the-art methods in accuracy and data efficiency on benchmark datasets.

Multi-source transfer learning provides an effective solution to data scarcity in real-world supervised learning scenarios by leveraging multiple source tasks. In this field, existing works typically use all available samples from sources in training, which constrains their training efficiency and may lead to suboptimal results. To address this, we propose a theoretical framework that answers the question: what is the optimal quantity of source samples needed from each source task to jointly train the target model? Specifically, we introduce a generalization error measure based on K-L divergence, and minimize it based on high-dimensional statistical analysis to determine the optimal transfer quantity for each source task. Additionally, we develop an architecture-agnostic and data-efficient algorithm OTQMS to implement our theoretical results for target model training in multi-source transfer learning. Experimental studies on diverse architectures and two real-world benchmark datasets show that our proposed algorithm significantly outperforms state-of-the-art approaches in both accuracy and data efficiency. The code and supplementary materials are available in https://github.com/zqy0126/OTQMS.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes