CVSep 11, 2025

Zero-shot Hierarchical Plant Segmentation via Foundation Segmentation Models and Text-to-image Attention

arXiv:2509.09116v2h-index: 8Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of reducing labor-intensive, species-specific annotation needs for plant segmentation in agriculture, though it is incremental as it builds on existing foundation models.

The paper tackles the challenge of hierarchical segmentation of entire plant individuals from top-view images without training, by integrating a foundation segmentation model for leaf extraction and a vision-language model for structural reasoning. The result is ZeroPlantSeg, which outperforms existing zero-shot methods and shows better cross-domain performance than supervised methods on datasets with multiple species, growth stages, and environments.

Foundation segmentation models achieve reasonable leaf instance extraction from top-view crop images without training (i.e., zero-shot). However, segmenting entire plant individuals with each consisting of multiple overlapping leaves remains challenging. This problem is referred to as a hierarchical segmentation task, typically requiring annotated training datasets, which are often species-specific and require notable human labor. To address this, we introduce ZeroPlantSeg, a zero-shot segmentation for rosette-shaped plant individuals from top-view images. We integrate a foundation segmentation model, extracting leaf instances, and a vision-language model, reasoning about plants' structures to extract plant individuals without additional training. Evaluations on datasets with multiple plant species, growth stages, and shooting environments demonstrate that our method surpasses existing zero-shot methods and achieves better cross-domain performance than supervised methods. Implementations are available at https://github.com/JunhaoXing/ZeroPlantSeg.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes