CVAIMay 20

Beyond Routing: Characterising Expert Tuning and Representation in Vision Mixture-of-Experts

arXiv:2605.2061057.0
Predicted impact top 61% in CV · last 90 daysOriginality Incremental advance
AI Analysis

For researchers interpreting MoE models, this work provides a more nuanced understanding of expert specialization, moving beyond simplistic category-based routing to reveal continuous feature tuning.

The paper analyzes expert specialization in vision Mixture-of-Experts models beyond routing, finding that experts exhibit broad tuning to continuous visual and semantic dimensions, with an animate-inanimate distinction dominating partitioning, and that expert-level analyses reveal richer specialization than category-level routing alone.

Mixture-of-Experts (MoE) models are often interpreted by analysing which categories are routed to which experts. However, routing alone does not reveal what each expert actually encodes. We train sparsely-gated convolutional MoE models with a contrastive objective on natural images and characterise expert specialisation using tools from visual neuroscience. Extending from gating-level to expert-level analyses, we measure per-expert category separability, and per-expert tuning using the most exciting inputs. Extending from category-level to feature-level explanations, we interpret tuning via semantic dimensions derived from a dataset of human behavioural judgements (THINGS). Finally, we use tuning and representational similarity analysis to assess the stability of expertise-allocation across independent initialisations. We find that an animate-inanimate distinction dominates expert partitioning, apparent from gating through to expert readout, and is stable across independently trained models. Although routing statistics suggest relatively sparse, categorical preferences, expert analyses reveal broader tuning to continuous visual and semantic dimensions that extend beyond category boundaries. Experts exhibit similar category-separability to one another, despite distinct feature tuning, demonstrating the explanatory benefits of moving beyond category-level analyses. Together, these results show that expert specialisation in vision MoEs extends well beyond category routing and is better understood by probing fine-grained expert-level tuning and representational structure.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes