AINEAug 22, 2025

Competition and Attraction Improve Model Fusion

arXiv:2508.16204v16 citationsh-index: 4Has CodeGECCO
Originality Highly original
AI Analysis

This addresses the limitation of fixed parameter groups in model merging for machine learning practitioners, offering a more flexible and efficient approach.

The paper tackles the problem of manually partitioning model parameters in model merging by proposing M2N2, an evolutionary algorithm that dynamically adjusts merging boundaries and uses competition and attraction mechanisms, achieving performance comparable to CMA-ES on MNIST classifiers and state-of-the-art results on language and image generation models.

Model merging is a powerful technique for integrating the specialized knowledge of multiple machine learning models into a single model. However, existing methods require manually partitioning model parameters into fixed groups for merging, which restricts the exploration of potential combinations and limits performance. To overcome these limitations, we propose Model Merging of Natural Niches (M2N2), an evolutionary algorithm with three key features: (1) dynamic adjustment of merging boundaries to progressively explore a broader range of parameter combinations; (2) a diversity preservation mechanism inspired by the competition for resources in nature, to maintain a population of diverse, high-performing models that are particularly well-suited for merging; and (3) a heuristicbased attraction metric to identify the most promising pairs of models for fusion. Our experimental results demonstrate, for the first time, that model merging can be used to evolve models entirely from scratch. Specifically, we apply M2N2 to evolve MNIST classifiers from scratch and achieve performance comparable to CMA-ES, while being computationally more efficient. Furthermore, M2N2 scales to merge specialized language and image generation models, achieving state-of-the-art performance. Notably, it preserves crucial model capabilities beyond those explicitly optimized by the fitness function, highlighting its robustness and versatility. Our code is available at https://github.com/SakanaAI/natural_niches

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes