Method DriftMixture-of-experts routing

Tracked

router-aware approach to optimize importance sampling weights

Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts

Mixture-of-experts routing · first seen Oct 27, 2025

current frontier — recent, not yet superseded in the knowledge base

0 papers critique it · 0 beat it on benchmarks

Newer alternatives

Recent methods in the same sub-problem, not yet superseded in the knowledge base.