CVROJul 9, 2018

Dynamic Objects Segmentation for Visual Localization in Urban Environments

arXiv:1807.02996v15 citations
Originality Incremental advance
AI Analysis

This addresses robustness issues in vision-based localization for mobile robotics in dynamic environments like city streets, representing an incremental improvement.

The paper tackles the problem of visual localization in crowded urban environments where dynamic objects cause feature inconsistency, by presenting an approach to automatically detect dynamic object instances using a CNN trained with synthetic and real-world data in a semi-supervised way, demonstrating reliable detection and promising performance on various datasets.

Visual localization and mapping is a crucial capability to address many challenges in mobile robotics. It constitutes a robust, accurate and cost-effective approach for local and global pose estimation within prior maps. Yet, in highly dynamic environments, like crowded city streets, problems arise as major parts of the image can be covered by dynamic objects. Consequently, visual odometry pipelines often diverge and the localization systems malfunction as detected features are not consistent with the precomputed 3D model. In this work, we present an approach to automatically detect dynamic object instances to improve the robustness of vision-based localization and mapping in crowded environments. By training a convolutional neural network model with a combination of synthetic and real-world data, dynamic object instance masks are learned in a semi-supervised way. The real-world data can be collected with a standard camera and requires minimal further post-processing. Our experiments show that a wide range of dynamic objects can be reliably detected using the presented method. Promising performance is demonstrated on our own and also publicly available datasets, which also shows the generalization capabilities of this approach.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes