LGDec 1, 2025

Fantastic Features and Where to Find Them: A Probing Method to combine Features from Multiple Foundation Models

Benjamin Ramtoula, Pierre-Yves Lajoie, Paul Newman, Daniele De Martini

arXiv:2512.01405v19.42 citationsh-index: 9

Originality Incremental advance

AI Analysis

This addresses the challenge of efficiently leveraging complementary strengths across diverse foundation models for practitioners in machine learning, though it is incremental as it builds on existing probing methods.

The paper tackles the problem of combining features from multiple foundation models for downstream tasks, proposing ComBo, a probing-based adapter that compresses activations and processes them with a lightweight transformer. On the VTAB-1k benchmark with 19 tasks, ComBo outperforms previous probing methods and matches or surpasses more expensive alternatives like distillation-based model merging.

Foundation models (FMs) trained with different objectives and data learn diverse representations, making some more effective than others for specific downstream tasks. Existing adaptation strategies, such as parameter-efficient fine-tuning, focus on individual models and do not exploit the complementary strengths across models. Probing methods offer a promising alternative by extracting information from frozen models, but current techniques do not scale well with large feature sets and often rely on dataset-specific hyperparameter tuning. We propose Combined backBones (ComBo), a simple and scalable probing-based adapter that effectively integrates features from multiple models and layers. ComBo compresses activations from layers of one or more FMs into compact token-wise representations and processes them with a lightweight transformer for task-specific prediction. Crucially, ComBo does not require dataset-specific tuning or backpropagation through the backbone models. However, not all models are equally relevant for all tasks. To address this, we introduce a mechanism that leverages ComBo's joint multi-backbone probing to efficiently evaluate each backbone's task-relevance, enabling both practical model comparison and improved performance through selective adaptation. On the 19 tasks of the VTAB-1k benchmark, ComBo outperforms previous probing methods, matches or surpasses more expensive alternatives, such as distillation-based model merging, and enables efficient probing of tuned models. Our results demonstrate that ComBo offers a practical and general-purpose framework for combining diverse representations from multiple FMs.

View on arXiv PDF

Similar