CLFeb 24

Learned but Not Expressed: Capability-Expression Dissociation in Large Language Models

arXiv:2603.18013
Originality Incremental advance
AI Analysis

This reveals a systematic dissociation between learned capability and expressed output in LLMs, with implications for generation dynamics and output control, though it is incremental in understanding model behavior.

The study found that large language models (LLMs) can reconstruct learned content under specific conditions but never express non-causal solutions in standard generation tasks, with 0% occurrence across 300 prompts, challenging the link between training data and output probability.

Large language models (LLMs) demonstrate the capacity to reconstruct and trace learned content from their training data under specific elicitation conditions, yet this capability does not manifest in standard generation contexts. This empirical observational study examines the expression of non-causal, non-implementable solution types across 300 prompt-response generations spanning narrative and problem-solving task contexts. Drawing on recent findings regarding memorization contiguity and alignment-induced discourse priors, we document a systematic dissociation between learned capability and expressed output. Across three distinct LLMs, ten task scenarios, and both creative narrative and practical advisory contexts, we documented zero instances of non-causal solution frames in generated outputs (0%, 95% CI: [0%, 1.2%]), despite verified reconstruction capability under conditional extraction. These findings challenge the prevailing assumption that training data presence directly predicts output probability, demonstrating instead that task-conditioned generation policies can comprehensively suppress learned content across diverse contexts. The results offer implications for understanding generation dynamics, output distribution control, and the behavioral boundaries of contemporary LLMs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes