KALE-LM-Chem: Vision and Practice Toward an AI Brain for Chemistry
This work aims to assist and accelerate scientific discovery in chemistry, but it appears incremental as it builds on existing LLM advancements for a specific domain.
The authors propose a vision for an AI-powered chemical brain with four core capabilities and introduce KALE-LM-Chem models, which achieved outstanding performance in chemistry tasks.
Recent advancements in large language models (LLMs) have demonstrated strong potential for enabling domain-specific intelligence. In this work, we present our vision for building an AI-powered chemical brain, which frames chemical intelligence around four core capabilities: information extraction, semantic parsing, knowledge-based QA, and reasoning & planning. We argue that domain knowledge and logic are essential pillars for enabling such a system to assist and accelerate scientific discovery. To initiate this effort, we introduce our first generation of large language models for chemistry: KALE-LM-Chem and KALE-LM-Chem-1.5, which have achieved outstanding performance in tasks related to the field of chemistry. We hope that our work serves as a strong starting point, helping to realize more intelligent AI and promoting the advancement of human science and technology, as well as societal development.