ROMar 10

Autonomous Search for Sparsely Distributed Visual Phenomena through Environmental Context Modeling

arXiv:2603.10174v112.91 citationsh-index: 15
Predicted impact top 41% in RO · last 90 daysOriginality Incremental advance
AI Analysis

This addresses the challenge of limited battery life in AUVs for marine ecology surveys, though it appears incremental as it builds on existing one-shot detection and adaptive planning methods.

The paper tackles the problem of efficiently locating sparsely distributed coral species with autonomous underwater vehicles by using visual environmental context as guidance, achieving sampling of 75% of targets in roughly half the time required by exhaustive coverage.

Autonomous underwater vehicles (AUVs) are increasingly used to survey coral reefs, yet efficiently locating specific coral species of interest remains difficult: target species are often sparsely distributed across the reef, and an AUV with limited battery life cannot afford to search everywhere. When detections of the target itself are too sparse to provide directional guidance, the robot benefits from an additional signal to decide where to look next. We propose using the visual environmental context -- the habitat features that tend to co-occur with a target species -- as that signal. Because context features are spatially denser and often vary more smoothly than target detections, we hypothesize that a reward function targeted at broader environmental context will enable adaptive planners to make better decisions on where to go next, even in regions where no target has yet been observed. Starting from a single labeled image, our method uses patch-level DINOv2 embeddings to perform one-shot detections of both the target species and its surrounding context online. We validate our approach using real imagery collected by an AUV at two reef sites in St. John, U.S. Virgin Islands, simulating the robot's motion offline. Our results demonstrate that one-shot detection combined with adaptive context modeling enables efficient autonomous surveying, sampling up to 75$\%$ of the target in roughly half the time required by exhaustive coverage when the target is sparsely distributed, and outperforming search strategies that only use target detections.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes