LGAIMLFeb 19

When to Trust the Cheap Check: Weak and Strong Verification for Reasoning

arXiv:2602.17633v13 citationsh-index: 11
Originality Incremental advance
AI Analysis

This addresses the efficiency-trust trade-off in deploying LLMs for reasoning tasks, offering a formal framework for verification policies, but it is incremental as it builds on existing verification concepts.

The paper tackles the problem of deciding when to trust cheap, noisy verification methods (weak verification) versus relying on costly, reliable human feedback (strong verification) in LLM reasoning systems, showing that optimal policies have a two-threshold structure and developing an online algorithm that provably controls errors without assumptions.

Reasoning with LLMs increasingly unfolds inside a broader verification loop. Internally, systems use cheap checks, such as self-consistency or proxy rewards, which we call weak verification. Externally, users inspect outputs and steer the model through feedback until results are trustworthy, which we call strong verification. These signals differ sharply in cost and reliability: strong verification can establish trust but is resource-intensive, while weak verification is fast and scalable but noisy and imperfect. We formalize this tension through weak--strong verification policies, which decide when to accept or reject based on weak verification and when to defer to strong verification. We introduce metrics capturing incorrect acceptance, incorrect rejection, and strong-verification frequency. Over population, we show that optimal policies admit a two-threshold structure and that calibration and sharpness govern the value of weak verifiers. Building on this, we develop an online algorithm that provably controls acceptance and rejection errors without assumptions on the query stream, the language model, or the weak verifier.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes