CVSep 30, 2025

Anchor-free Cross-view Object Geo-localization with Gaussian Position Encoding and Cross-view Association

arXiv:2509.25623v1h-index: 1

Originality Highly original

AI Analysis

This addresses the problem of object localization across different viewpoints for applications like surveillance or mapping, with incremental improvements over anchor-based methods.

The paper tackles cross-view object geo-localization by proposing an anchor-free method (AFGeo) that directly predicts offsets from pixels, eliminating dependency on predefined anchors, and achieves state-of-the-art performance on benchmark datasets.

Most existing cross-view object geo-localization approaches adopt anchor-based paradigm. Although effective, such methods are inherently constrained by predefined anchors. To eliminate this dependency, we first propose an anchor-free formulation for cross-view object geo-localization, termed AFGeo. AFGeo directly predicts the four directional offsets (left, right, top, bottom) to the ground-truth box for each pixel, thereby localizing the object without any predefined anchors. To obtain a more robust spatial prior, AFGeo incorporates Gaussian Position Encoding (GPE) to model the click point in the query image, mitigating the uncertainty of object position that challenges object localization in cross-view scenarios. In addition, AFGeo incorporates a Cross-view Object Association Module (CVOAM) that relates the same object and its surrounding context across viewpoints, enabling reliable localization under large cross-view appearance gaps. By adopting an anchor-free localization paradigm that integrates GPE and CVOAM with minimal parameter overhead, our model is both lightweight and computationally efficient, achieving state-of-the-art performance on benchmark datasets.

View on arXiv PDF

Similar