LLM-Guided Open Hypothesis Learning from Autonomous Scanning Probe Microscopy Experiments
For materials scientists and microscopists, this framework extends autonomous experimentation from closed-loop optimization to open hypothesis discovery, enabling the generation of new physical models from experimental data without pre-specified hypotheses.
This work introduces an open hypothesis-learning framework that combines symbolic regression with LLM-based physical evaluation for autonomous scanning probe microscopy. Applied to ferroelectric domain switching in PZT thin films, it evolves from five seed measurements toward interpretable voltage-time growth laws consistent with kinetic domain-wall motion.
Autonomous experimentation has transformed microscopy and materials discovery by enabling closed-loop optimization including imaging and spectroscopy tuning, strucutre property relationship discovery, and exploration of combinatorial libraries. However, most current workflows remain limited to selecting measurements within fixed objective or hypothesis spaces, rather than generating new physical models from experimental data. Here, we introduce an open hypothesis-learning framework that combines symbolic regression with large-language-model-based physical evaluation and implement it for autonomous scanning probe microscopy. Symbolic regression generates candidate analytical relationships directly from sparse measurements, while the language-model evaluator ranks these candidates according to physical plausibility, scaling behavior, and consistency with known mechanisms. We demonstrate the approach on autonomous piezoresponse force microscopy measurements of ferroelectric domain switching in a PZT thin film. Starting from five seed measurements, the workflow evolves from physically incomplete candidate expressions toward interpretable voltage-time growth laws consistent with kinetic domain-wall motion. This work extends autonomous microscopy from closed-loop optimization toward open hypothesis discovery, where candidate physical laws emerge from the experiment itself rather than being specified in advance. More broadly, the framework establishes a route for integrating symbolic regression, physical reasoning, and adaptive experimentation into hierarchical autonomous scientific workflows.