CVMar 14

MOGeo: Beyond One-to-One Cross-View Object Geo-localization

Bo Lv, Qingwang Zhang, Le Wu, Yuanyuan Li, Yingying Zhu

arXiv:2603.1384323.3h-index: 2

AI Analysis

This addresses the gap in practical geo-localization for real-world applications with multiple objects, but it is incremental as it extends an existing task without broad SOTA impact.

The paper tackles the problem of cross-view object geo-localization by introducing a new task for multi-object scenarios, as existing methods assume single objects, and constructs a benchmark dataset to validate their proposed method, showing it remains challenging with no specific performance numbers provided.

Cross-View Object Geo-Localization (CVOGL) aims to locate an object of interest in a query image within a corresponding satellite image. Existing methods typically assume that the query image contains only a single object, which does not align with the complex, multi-object geo-localization requirements in real-world applications, making them unsuitable for practical scenarios. To bridge the gap between the realistic setting and existing task, we propose a new task, called Cross-View Multi-Object Geo-Localization (CVMOGL). To advance the CVMOGL task, we first construct a benchmark, CMLocation, which includes two datasets: CMLocation-V1 and CMLocation-V2. Furthermore, we propose a novel cross-view multi-object geo-localization method, MOGeo, and benchmark it against existing state-of-the-art methods. Extensive experiments are conducted under various application scenarios to validate the effectiveness of our method. The results demonstrate that cross-view object geo-localization in the more realistic setting remains a challenging problem, encouraging further research in this area.

View on arXiv PDF

Similar