Method Drift›LLM reasoning / chain-of-thought
Superseded baseline#468 of 772 most-superseded
MoT (Merge of Thought)
LLM reasoning / chain-of-thought
superseded — cited as a baseline and beaten by newer methods
1 papers critique it · 0 beat it on benchmarks
What papers say
Verbatim critique sentences, each from a paper that cites MoT (Merge of Thought) as a baseline.
“static parameter addition or averaging (e.g., Shen2025MergeofThoughtD) often induces conflicts within the parameter space in distillation tasks. This leads to destructive interference or ``cancellation effects'' among diverse supervision signals, resulting in suboptimal performance.”
— "The Whole Is Greater Than the Sum of Its Parts": A Compatibility-Aware Multi-Teacher CoT Distillation Framework
Newer alternatives
Recent methods in the same sub-problem, not yet superseded in the knowledge base.
- Oct 15, 2025