AILGDec 29, 2025

Agentic Physical AI toward a Domain-Specific Foundation Model for Nuclear Reactor Control

arXiv:2512.23292v22 citationsh-index: 31
Originality Incremental advance
AI Analysis

This addresses safety-critical control in nuclear reactors, offering a novel approach to overcome limitations of general-purpose models, though it is domain-specific and incremental in its application.

The paper tackles the problem of AI for physical systems by proposing Agentic Physical AI, a domain-specific foundation model for nuclear reactor control that uses physics-based validation instead of perceptual inference, achieving a 500x reduction in variance and stabilizing execution-level behavior.

The prevailing paradigm in AI for physical systems, scaling general-purpose foundation models toward universal multimodal reasoning, confronts a fundamental barrier at the control interface. Recent benchmarks show that even frontier vision-language models achieve only 50-53% accuracy on basic quantitative physics tasks, behaving as approximate guessers that preserve semantic plausibility while violating physical constraints. This input unfaithfulness is not a scaling deficiency but a structural limitation. Perception-centric architectures optimize parameter-space imitation, whereas safety-critical control demands outcome-space guarantees over executed actions. Here, we present a fundamentally different pathway toward domain-specific foundation models by introducing compact language models operating as Agentic Physical AI, in which policy optimization is driven by physics-based validation rather than perceptual inference. We train a 360-million-parameter model on synthetic reactor control scenarios, scaling the dataset from 10^3 to 10^5 examples. This induces a sharp phase transition absent in general-purpose models. Small-scale systems exhibit high-variance imitation with catastrophic tail risk, while large-scale models undergo variance collapse exceeding 500x reduction, stabilizing execution-level behavior. Despite balanced exposure to four actuation families, the model autonomously rejects approximately 70% of the training distribution and concentrates 95% of runtime execution on a single-bank strategy. Learned representations transfer across distinct physics and continuous input modalities without architectural modification.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes