CVJun 24, 2024

GMT: Guided Mask Transformer for Leaf Instance Segmentation

arXiv:2406.17109v33 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the problem of accurately segmenting individual leaves for plant monitoring and crop yield estimation, representing an incremental improvement in a domain-specific task.

The paper tackled leaf instance segmentation by proposing the Guided Mask Transformer (GMT), which integrates spatial distribution priors to improve separation of overlapping leaves, achieving state-of-the-art performance on three public plant datasets.

Leaf instance segmentation is a challenging multi-instance segmentation task, aiming to separate and delineate each leaf in an image of a plant. Accurate segmentation of each leaf is crucial for plant-related applications such as the fine-grained monitoring of plant growth and crop yield estimation. This task is challenging because of the high similarity (in shape and colour), great size variation, and heavy occlusions among leaf instances. Furthermore, the typically small size of annotated leaf datasets makes it more difficult to learn the distinctive features needed for precise segmentation. We hypothesise that the key to overcoming the these challenges lies in the specific spatial patterns of leaf distribution. In this paper, we propose the Guided Mask Transformer (GMT), which leverages and integrates leaf spatial distribution priors into a Transformer-based segmentor. These spatial priors are embedded in a set of guide functions that map leaves at different positions into a more separable embedding space. Our GMT consistently outperforms the state-of-the-art on three public plant datasets. Our code is available at https://github.com/vios-s/gmt-leaf-ins-seg.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes