CV LG MMMar 9, 2025

GroMo: Plant Growth Modeling with Multiview Images

Ruchi Bhatt, Shreya Bansal, Amanpreet Chander, Rupinder Kaur, Malya Singh, Mohan Kankanhalli, Abdulmotaleb El Saddik, Mukesh Kumar Saini

arXiv:2503.06608v25 citationsh-index: 30Has Code

Originality Synthesis-oriented

AI Analysis

This addresses crop monitoring and precision agriculture for farmers and researchers, but is incremental as it builds on existing plant phenotyping methods.

The paper tackles plant growth modeling by introducing the GroMo challenge with tasks for age prediction and leaf count estimation, using a new dataset GroMo25 and a Multiview Vision Transformer model that achieves an average MAE of 7.74 for age and 5.52 for leaf count.

Understanding plant growth dynamics is essential for applications in agriculture and plant phenotyping. We present the Growth Modelling (GroMo) challenge, which is designed for two primary tasks: (1) plant age prediction and (2) leaf count estimation, both essential for crop monitoring and precision agriculture. For this challenge, we introduce GroMo25, a dataset with images of four crops: radish, okra, wheat, and mustard. Each crop consists of multiple plants (p1, p2, ..., pn) captured over different days (d1, d2, ..., dm) and categorized into five levels (L1, L2, L3, L4, L5). Each plant is captured from 24 different angles with a 15-degree gap between images. Participants are required to perform both tasks for all four crops with these multiview images. We proposed a Multiview Vision Transformer (MVVT) model for the GroMo challenge and evaluated the crop-wise performance on GroMo25. MVVT reports an average MAE of 7.74 for age prediction and an MAE of 5.52 for leaf count. The GroMo Challenge aims to advance plant phenotyping research by encouraging innovative solutions for tracking and predicting plant growth. The GitHub repository is publicly available at https://github.com/mriglab/GroMo-Plant-Growth-Modeling-with-Multiview-Images.

View on arXiv PDF Code

Similar