LG AI CEMay 16, 2024

LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery

Pingchuan Ma, Tsun-Hsuan Wang, Minghao Guo, Zhiqing Sun, Joshua B. Tenenbaum, Daniela Rus, Chuang Gan, Wojciech Matusik

arXiv:2405.09783v132.585 citationsh-index: 122Has CodeICML

Originality Incremental advance

AI Analysis

This addresses the problem of enhancing scientific discovery in physics and chemistry by bridging abstract reasoning with computational simulation, representing a new paradigm rather than an incremental improvement.

The paper tackles the challenge of integrating Large Language Models (LLMs) with simulations to advance physical scientific discovery, proposing a bilevel optimization framework called Scientific Generative Agent (SGA) that combines LLMs for hypothesis generation with simulations for experimental feedback, resulting in novel solutions in constitutive law discovery and molecular design.

Large Language Models have recently gained significant attention in scientific discovery for their extensive knowledge and advanced reasoning capabilities. However, they encounter challenges in effectively simulating observational feedback and grounding it with language to propel advancements in physical scientific discovery. Conversely, human scientists undertake scientific discovery by formulating hypotheses, conducting experiments, and revising theories through observational analysis. Inspired by this, we propose to enhance the knowledge-driven, abstract reasoning abilities of LLMs with the computational strength of simulations. We introduce Scientific Generative Agent (SGA), a bilevel optimization framework: LLMs act as knowledgeable and versatile thinkers, proposing scientific hypotheses and reason about discrete components, such as physics equations or molecule structures; meanwhile, simulations function as experimental platforms, providing observational feedback and optimizing via differentiability for continuous parts, such as physical parameters. We conduct extensive experiments to demonstrate our framework's efficacy in constitutive law discovery and molecular design, unveiling novel solutions that differ from conventional human expectations yet remain coherent upon analysis.

View on arXiv PDF Code

Similar