From Physics to Foundation Models: A Review of AI-Driven Quantitative Remote Sensing Inversion
It provides a systematic overview for researchers and practitioners in remote sensing and AI, focusing on methodological shifts but is incremental as a review paper.
This paper reviews the evolution of quantitative remote sensing inversion techniques, from physics-based models to data-driven and foundation model approaches, highlighting recent advances in self-supervised pretraining and multi-modal integration while addressing challenges like physical interpretability and domain generalization.
Quantitative remote sensing inversion aims to estimate continuous surface variables-such as biomass, vegetation indices, and evapotranspiration-from satellite observations, supporting applications in ecosystem monitoring, carbon accounting, and land management. With the evolution of remote sensing systems and artificial intelligence, traditional physics-based paradigms are giving way to data-driven and foundation model (FM)-based approaches. This paper systematically reviews the methodological evolution of inversion techniques, from physical models (e.g., PROSPECT, SCOPE, DART) to machine learning methods (e.g., deep learning, multimodal fusion), and further to foundation models (e.g., SatMAE, GFM, mmEarth). We compare the modeling assumptions, application scenarios, and limitations of each paradigm, with emphasis on recent FM advances in self-supervised pretraining, multi-modal integration, and cross-task adaptation. We also highlight persistent challenges in physical interpretability, domain generalization, limited supervision, and uncertainty quantification. Finally, we envision the development of next-generation foundation models for remote sensing inversion, emphasizing unified modeling capacity, cross-domain generalization, and physical interpretability.