CVAIMay 2, 2024

Addressing Diverging Training Costs using BEVRestore for High-resolution Bird's Eye View Map Construction

arXiv:2405.01016v41 citationsh-index: 3IEEE Robot Autom Lett
Originality Incremental advance
AI Analysis

This addresses a critical bottleneck for autonomous driving systems by improving motion planning safety through more accurate BEV maps.

The paper tackles the diverging training costs issue in high-resolution Bird's Eye View (BEV) map construction by proposing BEVRestore, which reduces GPU memory consumption and computing latency, enabling precise mapping of urban scene components like road lanes and sidewalks.

Recent advancements in Bird's Eye View (BEV) fusion for map construction have demonstrated remarkable mapping of urban environments. However, their deep and bulky architecture incurs substantial amounts of backpropagation memory and computing latency. Consequently, the problem poses an unavoidable bottleneck in constructing high-resolution (HR) BEV maps, as their large-sized features cause significant increases in costs including GPU memory consumption and computing latency, named diverging training costs issue. Affected by the problem, most existing methods adopt low-resolution (LR) BEV and struggle to estimate the precise locations of urban scene components like road lanes, and sidewalks. As the imprecision leads to risky motion planning like collision avoidance, the diverging training costs issue has to be resolved. In this paper, we address the issue with our novel BEVRestore mechanism. Specifically, our proposed model encodes the features of each sensor to LR BEV space and restores them to HR space to establish a memory-efficient map constructor. To this end, we introduce the BEV restoration strategy, which restores aliasing, and blocky artifacts of the up-scaled BEV features, and narrows down the width of the labels. Our extensive experiments show that the proposed mechanism provides a plug-and-play, memory-efficient pipeline, enabling an HR map construction with a broad BEV scope.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes