CVSep 11, 2024

PaveSAM Segment Anything for Pavement Distress

arXiv:2409.07295v118 citationsh-index: 21Has Code
Originality Incremental advance
AI Analysis

This work addresses the need for efficient pavement monitoring for infrastructure management, though it is incremental as it adapts an existing model (SAM) to a specific domain.

The paper tackled the problem of costly pixel-level annotations for pavement distress segmentation by proposing PaveSAM, a zero-shot model that segments pavement distresses using bounding box prompts, achieving high performance with only 180 training images and reducing labeling efforts.

Automated pavement monitoring using computer vision can analyze pavement conditions more efficiently and accurately than manual methods. Accurate segmentation is essential for quantifying the severity and extent of pavement defects and consequently, the overall condition index used for prioritizing rehabilitation and maintenance activities. Deep learning-based segmentation models are however, often supervised and require pixel-level annotations, which can be costly and time-consuming. While the recent evolution of zero-shot segmentation models can generate pixel-wise labels for unseen classes without any training data, they struggle with irregularities of cracks and textured pavement backgrounds. This research proposes a zero-shot segmentation model, PaveSAM, that can segment pavement distresses using bounding box prompts. By retraining SAM's mask decoder with just 180 images, pavement distress segmentation is revolutionized, enabling efficient distress segmentation using bounding box prompts, a capability not found in current segmentation models. This not only drastically reduces labeling efforts and costs but also showcases our model's high performance with minimal input, establishing the pioneering use of SAM in pavement distress segmentation. Furthermore, researchers can use existing open-source pavement distress images annotated with bounding boxes to create segmentation masks, which increases the availability and diversity of segmentation pavement distress datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes