CVMay 12, 2022

Building Facade Parsing R-CNN

arXiv:2205.05912v11 citationsh-index: 33Has Code
Originality Incremental advance
AI Analysis

This addresses facade parsing for autonomous vehicle perception, but it is incremental as it adapts existing methods to a new viewpoint.

The paper tackles building facade parsing from deformed camera views for autonomous vehicles, proposing Facade R-CNN with a transconv module and convex regularization, which outperforms state-of-the-art models designed for frontal views.

Building facade parsing, which predicts pixel-level labels for building facades, has applications in computer vision perception for autonomous vehicle (AV) driving. However, instead of a frontal view, an on-board camera of an AV captures a deformed view of the facade of the buildings on both sides of the road the AV is travelling on, due to the camera perspective. We propose Facade R-CNN, which includes a transconv module, generalized bounding box detection, and convex regularization, to perform parsing of deformed facade views. Experiments demonstrate that Facade R-CNN achieves better performance than the current state-of-the-art facade parsing models, which are primarily developed for frontal views. We also publish a new building facade parsing dataset derived from the Oxford RobotCar dataset, which we call the Oxford RobotCar Facade dataset. This dataset contains 500 street-view images from the Oxford RobotCar dataset augmented with accurate annotations of building facade objects. The published dataset is available at https://github.com/sijieaaa/Oxford-RobotCar-Facade

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes