LGAIMar 20, 2025

Knowledge-guided machine learning for county-level corn yield prediction under drought

arXiv:2503.16328v3h-index: 54
Originality Incremental advance
AI Analysis

This work addresses the challenge of improving crop yield predictions for agricultural stakeholders, though it appears incremental as it builds on existing knowledge-guided methods by specifically integrating soil moisture and drought effects.

The authors tackled the problem of predicting county-level corn yield under drought by developing a knowledge-guided machine learning framework that incorporates soil moisture and a drought-aware loss function, resulting in a model that outperformed traditional machine learning approaches.

Remote sensing (RS) technique, enabling the non-contact acquisition of extensive ground observations, is a valuable tool for crop yield predictions. Traditional process-based models struggle to incorporate large volumes of RS data, and most users lack understanding of crop growth mechanisms. In contrast, machine learning (ML) models are often criticized as "black boxes" due to their limited interpretability. To address these limitations, we utilized Knowledge-Guided Machine Learning (KGML), a framework that leverages the strengths of both process-based and ML models. Existing works have either overlooked the role of soil moisture in corn growth or did not embed this effect into their models. To bridge this gap, we developed the Knowledge-Guided Machine Learning with Soil Moisture (KGML-SM) framework, treating soil moisture as an intermediate variable in corn growth to emphasize its key role in plant development. Additionally, based on the prior knowledge that the model may overestimate under drought conditions, we designed a drought-aware loss function that penalized predicted yield in drought-affected areas. Our experiments showed that the KGML-SM model outperformed other traditional ML models. We explored the relationships between drought, soil moisture, and corn yield prediction by assessing the importance of different features within the model, and analyzing how soil moisture impacts predictions across different regions and time periods. Finally we provided interpretability for prediction errors to guide future model optimization.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes