ChemATP: A Training-Free Chemical Reasoning Framework for Large Language Models
This addresses the challenge of precise chemical reasoning for molecular science applications, offering an incremental improvement by decoupling knowledge from the reasoning engine.
The paper tackled the problem of large language models struggling with chemical reasoning due to lack of explicit priors, by introducing ChemATP, a training-free framework that uses an atom-level textual knowledge base, resulting in performance that significantly outperforms training-free baselines and rivals state-of-the-art training-based models.
Large Language Models (LLMs) exhibit strong general reasoning but struggle in molecular science due to the lack of explicit chemical priors in standard string representations. Current solutions face a fundamental dilemma. Training-based methods inject priors into parameters, but this static coupling hinders rapid knowledge updates and often compromises the model's general reasoning capabilities. Conversely, existing training-free methods avoid these issues but rely on surface-level prompting, failing to provide the fine-grained atom-level priors essential for precise chemical reasoning. To address this issue, we introduce ChemATP, a framework that decouples chemical knowledge from the reasoning engine. By constructing the first atom-level textual knowledge base, ChemATP enables frozen LLMs to explicitly retrieve and reason over this information dynamically. This architecture ensures interpretability and adaptability while preserving the LLM's intrinsic general intelligence. Experiments show that ChemATP significantly outperforms training-free baselines and rivals state-of-the-art training-based models, demonstrating that explicit prior injection is a competitive alternative to implicit parameter updates.