MLLGSTMEMar 20, 2024

AdaTrans: Feature-wise and Sample-wise Adaptive Transfer Learning for High-dimensional Regression

arXiv:2403.13565v35 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses the problem of efficient knowledge transfer in high-dimensional settings for statisticians and machine learning practitioners, offering incremental improvements over existing methods.

The paper tackles transfer learning for high-dimensional linear regression by proposing adaptive methods (F-AdaTrans and S-AdaTrans) that detect and aggregate transferable structures across features or source samples, achieving convergence rates close to an oracle estimator and near-minimax optimal rates in simulations and real data.

We consider the transfer learning problem in the high dimensional linear regression setting, where the feature dimension is larger than the sample size. To learn transferable information, which may vary across features or the source samples, we propose an adaptive transfer learning method that can detect and aggregate the feature-wise (F-AdaTrans) or sample-wise (S-AdaTrans) transferable structures. We achieve this by employing a fused-penalty, coupled with weights that can adapt according to the transferable structure. To choose the weight, we propose a theoretically informed, data-driven procedure, enabling F-AdaTrans to selectively fuse the transferable signals with the target while filtering out non-transferable signals, and S-AdaTrans to obtain the optimal combination of information transferred from each source sample. We show that, with appropriately chosen weights, F-AdaTrans achieves a convergence rate close to that of an oracle estimator with a known transferable structure, and S-AdaTrans recovers existing near-minimax optimal rates as a special case. The effectiveness of the proposed method is validated using both simulation and real data, demonstrating favorable performance compared to the existing methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes