LGAINEFeb 10, 2025

Physics-Guided Foundation Model for Scientific Discovery: An Application to Aquatic Science

arXiv:2502.06084v116 citationsh-index: 15AAAI
Originality Highly original
AI Analysis

This work addresses the problem of limited applicability of physics-guided machine learning approaches to complex systems, which is significant for scientists and researchers in various fields, particularly aquatic science.

The authors tackled the problem of integrating scientific theories into machine learning models for complex systems, and their proposed Physics-Guided Foundation Model (PGFM) achieved effective modeling of multiple coupled processes, such as water temperature and dissolved oxygen dynamics in lakes. The model demonstrated its ability to adapt to various systems while maintaining consistency with physical theories.

Physics-guided machine learning (PGML) has become a prevalent approach in studying scientific systems due to its ability to integrate scientific theories for enhancing machine learning (ML) models. However, most PGML approaches are tailored to isolated and relatively simple tasks, which limits their applicability to complex systems involving multiple interacting processes and numerous influencing features. In this paper, we propose a \textit{\textbf{P}hysics-\textbf{G}uided \textbf{F}oundation \textbf{M}odel (\textbf{PGFM})} that combines pre-trained ML models and physics-based models and leverages their complementary strengths to improve the modeling of multiple coupled processes. To effectively conduct pre-training, we construct a simulated environmental system that encompasses a wide range of influencing features and various simulated variables generated by physics-based models. The model is pre-trained in this system to adaptively select important feature interactions guided by multi-task objectives. We then fine-tune the model for each specific task using true observations, while maintaining consistency with established physical theories, such as the principles of mass and energy conservation. We demonstrate the effectiveness of this methodology in modeling water temperature and dissolved oxygen dynamics in real-world lakes. The proposed PGFM is also broadly applicable to a range of scientific fields where physics-based models are being used.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes