CVDec 3, 2025

MOS: Mitigating Optical-SAR Modality Gap for Cross-Modal Ship Re-Identification

arXiv:2512.03404v11 citationsh-index: 2
Originality Incremental advance
AI Analysis

This addresses a critical but underexplored problem in maritime intelligence and surveillance, though it appears incremental as it builds on existing cross-modal ReID approaches.

The paper tackles cross-modal ship re-identification between optical and SAR imagery by proposing the MOS framework to mitigate the modality gap, achieving improvements of +3.0% to +16.4% in R1 accuracy over state-of-the-art methods on the HOSS ReID dataset.

Cross-modal ship re-identification (ReID) between optical and synthetic aperture radar (SAR) imagery has recently emerged as a critical yet underexplored task in maritime intelligence and surveillance. However, the substantial modality gap between optical and SAR images poses a major challenge for robust identification. To address this issue, we propose MOS, a novel framework designed to mitigate the optical-SAR modality gap and achieve modality-consistent feature learning for optical-SAR cross-modal ship ReID. MOS consists of two core components: (1) Modality-Consistent Representation Learning (MCRL) applies denoise SAR image procession and a class-wise modality alignment loss to align intra-identity feature distributions across modalities. (2) Cross-modal Data Generation and Feature fusion (CDGF) leverages a brownian bridge diffusion model to synthesize cross-modal samples, which are subsequently fused with original features during inference to enhance alignment and discriminability. Extensive experiments on the HOSS ReID dataset demonstrate that MOS significantly surpasses state-of-the-art methods across all evaluation protocols, achieving notable improvements of +3.0%, +6.2%, and +16.4% in R1 accuracy under the ALL to ALL, Optical to SAR, and SAR to Optical settings, respectively. The code and trained models will be released upon publication.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes