LG AIMar 16

Directional Routing in Transformers

arXiv:2603.1492312.4

AI Analysis

This addresses the issue of inefficient attention coordination in transformers for AI researchers, though it is incremental as it builds on existing transformer architectures.

The paper tackled the problem of transformer attention heads lacking coordination by introducing directional routing, a lightweight mechanism that became the model's dominant computational pathway, reducing perplexity by 31-56% relative to a baseline and collapsing factual recall to near-zero probability when disabled.

We introduce directional routing, a lightweight mechanism that gives each transformer attention head learned suppression directions controlled by a shared router, at 3.9% parameter cost. We train a 433M-parameter model alongside an identical baseline in a single run, then trace the resulting circuits through mechanistic interpretability. Routing becomes the model's dominant computational pathway. Disabling it collapses factual recall to near-zero probability across all 8 test prompts and drops induction accuracy from 93.4% to 0.0%. Knocking out individual attention heads has negligible effect: the primary mover head's removal actually increases target probability, and induction heads retain 98.6% accuracy without their strongest member. The coordination mechanism is irreplaceable; the components it coordinates are not. The model also self-organizes, without explicit pressure, into two regimes: domain-adaptive routing in early layers and fixed syntactic pruning in late layers, where the least-varying layer is the most critical (+42.6 PPL when disabled). Routing reduces perplexity 31-56% relative to the baseline, though downstream multiple-choice benchmarks do not yet reflect these gains.

View on arXiv PDF

Similar