Method DriftMixture-of-experts routing

Heavily superseded#4 of 1,370 most-superseded

HydraLoRA

HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning

Mixture-of-experts routing · first seen Apr 30, 2024

heavily superseded — a standard baseline that newer methods routinely beat

1 papers critique it · 8 beat it on benchmarks

What papers say

Verbatim critique sentences, each from a paper that cites HydraLoRA as a baseline.

  • Compared to existing state-of-the-art MoE baselines (Switch Transformer, MoLE, HydraLoRA), HiLoMoE consistently shows superior efficiency and effectiveness. On average, it improves AUC by 0.08% and reduces LogLoss by 0.10% compared to the best competing MoE variant (HydraLoRA). At the same time, HiLoMoE reduces parameter count by an average of 4.04K, which is equivalent to a 21.0% reduction relative to the most parameter-efficient MoE competitor (HydraLoRA).
    Hierarchical LoRA MoE for Efficient CTR Model Scaling

Beaten on benchmarks

Head-to-head results where a newer method reports beating HydraLoRA. Values are copied from the source paper's tables — verify against the cited paper.

Newer alternatives

Recent methods in the same sub-problem, not yet superseded in the knowledge base.