CV LGApr 20

DiffuSAM: Diffusion Guided Zero-Shot Object Grounding for Remote Sensing Imagery

Geet Sethi, Panav Shah, Ashutosh Gandhe, Soumitra Darshan Nayak

arXiv:2604.1820149.7h-index: 5

AI Analysis

Improves object localization accuracy for remote sensing applications, which is a domain-specific problem.

The paper proposes a hybrid pipeline combining diffusion-based localization cues with segmentation models (RemoteSAM, SAM3) to improve object grounding in remote sensing imagery, achieving over 14% increase in Acc@0.5 over state-of-the-art methods.

Diffusion models have emerged as powerful tools for a wide range of vision tasks, including text-guided image generation and editing. In this work, we explore their potential for object grounding in remote sensing imagery. We propose a hybrid pipeline that integrates diffusion-based localization cues with state-of-the-art segmentation models such as RemoteSAM and SAM3 to obtain more accurate bounding boxes. By leveraging the complementary strengths of generative diffusion models and foundational segmentation models, our approach enables robust and adaptive object localization across complex scenes. Experiments demonstrate that our pipeline significantly improves localization performance, achieving over a 14% increase in Acc@0.5 compared to existing state-of-the-art methods.

View on arXiv PDF

Similar